FLaNK Weekly for 12 February 2024
12-February-2024
FLaNK Stack Weekly
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
Get your new Apache NiFi for Dummies!
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
Trial: https://console.us-west-1.cdp.cloudera.com/trial/register.html#/
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #124 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.cloudera.com/solutions/dim-developer.html
Qualified Developers (Looking for work)
https://www.linkedin.com/in/satya-n99999/
Courses
https://cs50.harvard.edu/python/2022/
https://developers.google.com/machine-learning/crash-course
https://cognitiveclass.ai/courses/docker-essentials
https://www.scaler.com/topics/course/java-beginners/
https://online.stanford.edu/courses/soe-ydatabases0005-databases-relational-databases-and-sql
Articles
Open Source Data Infrastructure Meetup - Feb 2024 https://medium.com/@tspann/open-source-data-infrastructure-meetup-feb-2024-9e8048666828
Apache NiFi with Amazon Translate for ML Pipelines https://medium.com/@tspann/apache-nifi-and-amazon-translate-for-machine-learning-pipelines-dcfe4e61fc02
NiFi 2.0.0-M2 is Out! https://medium.com/@tspann/apache-nifi-2-0-0-m2-out-314a1d4c8b20
Apache NiFi Python extensions https://apex974.com/articles/nifi-2-python-extensions
Apache NiFi and Amazon Textract for Machine Learning https://medium.com/@tspann/apache-nifi-and-amazon-textract-for-machine-learning-e45f4af12e68
Using Apache NiFi API to Start and Stop NiFi Properties https://www.clearpeaks.com/using-the-nifi-api-to-start-and-stop-nifi-processors-from-a-nifi-flow/
Langchain from 0 to 1 https://fosdem.org/2024/events/attachments/fosdem-2024-2384-langchain-from-0-to-1-unveiling-the-power-of-llm-programming/slides/21698/LangChain_From_0_To_1_public_1_PpuSgEN.pdf
Local First AI with Postgres PgVector https://electric-sql.com/blog/2024/02/05/local-first-ai-with-tauri-postgres-pgvector-llama
RAG Details https://www.infoq.com/podcasts/retrieval-augmented-generation
ChromaDB in Java (LangChain4J) https://medium.com/@timju/chromadb-in-java-langchain4j-41ed910cd3e7
SQL Tutorial https://gvwilson.github.io/sql-tutorial
Gen AI to Edge https://developer.nvidia.com/blog/bringing-generative-ai-to-the-edge-with-nvidia-metropolis-microservices-for-jetson/
I want to know what small device can run Java for IoT and fit in a toothbrush. I want to develop stuff for that platform. https://www.zdnet.com/home-and-office/smart-home/3-million-smart-toothbrushes-were-just-used-in-a-ddos-attack-really/
https://www.unlogged.io/post/springboot-vs-quarkus-vs-micronaut
https://ionutbalosin.com/2024/02/jvm-performance-comparison-for-jdk-21/
https://thewritetoroam.com/2024/02/how-to-write-stuff-no-one-else-can
https://www.morling.dev/blog/filtering-process-output-with-tee/
Videos
Seven Videos on Real-Time Streaming https://medium.com/@tspann/seven-videos-on-real-time-streaming-02711320afa8
Unlocking Financial Data with Real-Time Pipelines (OSACon 2023) https://www.youtube.com/watch?v=Q7gF7m4yFi4&ab_channel=OSACon
Tips
February 8, 2024 Meetup
Events
Feb 20, 2024: 12-1PM EST. Virtual. Azure Data Tech Groups: DBA Fundamentals Group https://www.meetup.com/dba-fundamentals-group/events/296855261/
Feb 28, 2024: NYC. Cloudera Meetup. Flink https://www.meetup.com/futureofdata-princeton/events/298661947/
Feb 29, 2024: Virtual. Conf42 Python. https://www.conf42.com/Python_2024_Tim_Spann_apache_nifi_2_processors
https://www.conf42.com/Python_2024_Karin_Wolok_nifi__kafka_risingwave_iceberg_llm
March 5, 2024: Princeton. Meetup. GenAI. https://www.meetup.com/applied-generative-artificial-intelligence-applications/
March 15, 2024: TCF Pro. Princeton, NJ. IT Professional Conference at Trenton Computer Festival IEEE Information Technology Professional Conference on Friday, March 15th, 2024 https://princetonacm.acm.org/tcfpro/
April 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/
May 8-9, 2024: Data Summit 2024. Boston, MA. https://www.dbta.com/DataSummit/2024/default.aspx
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe
Code
- https://github.com/tspannhw/FLaNK-python-watsonx-processor
- https://github.com/kevinbtalbert/CML_AMP_NeMo-Guardrails-Chatbot
Models
Data
- https://github.com/trending/python
- https://github.com/trending/java?since=daily
- https://github.com/josephmisiti/awesome-machine-learning
- https://oceanservice.noaa.gov/education/tutorial_currents/04currents3.html
- https://water.weather.gov/ahps2/hydrograph.php?gage=BKWN4&wfo=phi
Tools
- https://github.com/rsrohan99/llamabot
- https://github.com/google-deepmind/graphcast
- https://open-meteo.com/en/docs/ensemble-api
- https://github.com/Stell0/fosdem2024/
- https://github.com/cfahlgren1/natural-sql
- https://github.com/electric-sql/electric/tree/tauri-example-postgres/examples/tauri-postgres
- https://github.com/QwenLM/Qwen1.5
- https://github.com/nomic-ai/contrastors
- https://checkip.amazonaws.com/
- https://github.com/dvcoolarun/web2pdf
- https://github.com/narfindustries/http-garden
- https://github.com/kevingduck/ChatGPT-phone
- https://github.com/SuperDuperDB/superduperdb
- https://github.com/allegroai/clearml
- https://github.com/mosaicml/streaming
- https://github.com/Sanster/IOPaint
- https://github.com/ververica/flink-cdc-connectors
- https://github.com/grpc/grpc-java
- https://github.com/in28minutes/master-spring-and-spring-boot
- https://github.com/dapr/quickstarts/tree/master/tutorials/bindings
- https://github.com/TheoKanning/openai-java
- https://github.com/second-opinion-ai/second-opinion
- https://rustpython.github.io/
- https://github.com/kevinbtalbert/Healthcare-Demo
- https://www.promptfoo.dev/docs/guides/mistral-vs-llama/
- https://github.com/allenai/olmo
- https://aitestkitchen.withgoogle.com/tools/image-fx
- https://tabulator.info/docs/5.5/quickstart
- https://app.getonboardai.com/chat/ike71usxhomcsoaalu3mi?repo=github%3A%3Alangchain-ai%2Flangchain
- https://github.com/osrd-project/osrd
- https://unstructured-io.github.io/unstructured/installation/full_installation.html
- https://faiss.ai/
- https://github.com/langchain4j/langchain4j-examples
- https://thenewstack.io/linux-hide-your-shell-passwords-with-sshpass
- https://github.com/CorentinJ/Real-Time-Voice-Cloning
- https://github.com/neonbjb/tortoise-tts
- https://github.com/coqui-ai/TTS
- https://github.com/netease-youdao/EmotiVoice
- https://github.com/lobehub/lobe-chat
- https://github.com/FlowiseAI/Flowise
- https://github.com/f/awesome-chatgpt-prompts
- https://github.com/labring/FastGPT
- https://github.com/eosphoros-ai/Awesome-Text2SQL
- https://github.com/vanna-ai/vanna
- https://github.com/sqlchat/sqlchat
- https://github.com/chat2db/chat2db
- https://github.com/Plachtaa/VALL-E-X
- https://zellij.dev/screencasts/
- https://github.com/zilliztech/attu
- https://www.textualize.io/
- https://getdeploying.com/reference/data-egress
- https://github.com/ckampfe/jindex
- https://github.com/openvinotoolkit/openvino_notebooks
- ggerganov/llama.cpp#4167
- https://lilianweng.github.io/posts/2023-10-25-adv-attack-llm/
- https://github.com/InterLinked1/lbbs
- https://www.mysticbbs.com/downloads.html
- https://wiki.synchro.net/howto:raspbian_install
- https://wiki.seeedstudio.com/Local_Voice_Chatbot/
- https://learnopencv.com/yolo-loss-function-gfl-vfl-loss/
- https://huggingface.co/blog/tgi-messages-api
- https://github.com/7mind/sick
- https://github.com/adamritter/fastgron
- https://github.com/Textualize/toolong
Interesting
Automate YouTube Shorts Creation https://github.com/FujiwaraChoki/MoneyPrinter/tree/main
© 2020-2024 Tim Spann
FLaNK Stack Weekly for 09 Oct 2023
FLiPN-FLaNK Stack Weekly
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
Get your new Apache NiFi for Dummies!
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #106 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
https://www.cloudera.com/solutions/dim-developer.html
Flink got added to OSS Chat! https://osschat.io/chat?project=Flink
Halifax Community over Code was great. Lots of cool people and projects. LLM with Tika and OpenNLP and LangStream. Pulsar, Kafka, NiFi, Iceberg, Ozone, Calcite, HBase, Hive and more had great sessions.
Articles
https://dzone.com/articles/real-time-analytics-1
DZone Data Pipeline Trend Report https://dzone.com/trendreports/data-pipelines-2
https://cldr-steven-matison.github.io/blog/SSB-Iceberg-Demo/
https://www.linkedin.com/pulse/apache-pulsar-trailblazer-validated-recent-kafka-david-kjerrumgaard/
https://dl.acm.org/doi/abs/10.1145/3597060.3597237
https://www.ibm.com/blog/watsonx-tailored-generative-ai/
https://www.ibm.com/downloads/cas/X9W4O6BM
https://www.wired.com/story/heisse-preise-food-prices/
https://www.hackster.io/shahizat/how-to-run-a-chatgpt-like-llm-on-nvidia-jetson-board-41fd79
https://www.hackster.io/nickbild/voicegpt-f88f8f
https://annas-blog.org/worldcat-scrape.html
https://benchmark.vectorview.ai/vectordbs.html
https://www.jesse-anderson.com/2023/10/current-2023-announcements/
https://jack-vanlightly.com/blog/2023/10/10/a-primer-on-formal-verification-and-tla
https://towardsdatascience.com/forget-rag-the-future-is-rag-fusion-1147298d8ad1
https://github.com/Leantime/leantime
Videos
https://www.youtube.com/watch?v=8cZJ9CyLYyI&ab_channel=Cloudera%2CInc.
https://www.youtube.com/shorts/0oJ1TM-H52s
https://www.youtube.com/watch?v=ROy4b_-w-Iw
https://www.youtube.com/shorts/EyfR4hOSMA0
Events
October 18, 2023: 2-Hours to Data Innovation: Data Flow https://www.cloudera.com/about/events/hands-on-lab-series-2-hours-to-data-innovation.html
October 26, 2023: Future of Data NYC. Meetup. Hybrid. https://www.meetup.com/futureofdata-newyork/events/295516928/
October 26, 2023: Cloudera Now EMEA. Virtual. https://www.cloudera.com/about/events/cloudera-now-cdp/emea.html
November 1, 2023: Open Source Finance Forum. Virtual. https://events.linuxfoundation.org/open-source-finance-forum-new-york/
November 1, 2023 7PM EST: AI Dev World. Hybrid https://aidevworld.com/conference/
November 2, 2023: Evolve. NYC https://www.cloudera.com/about/events/evolve/new-york.html#register
November 7, 2023: XtremeJ 2023. Virtual. https://xtremej.dev/2023/schedule/
November 8, 2023: Flink Forward, Seattle. https://www.flink-forward.org/seattle-2023
November 21, 2023: JCon World. Virtual. https://sched.co/1RRWm
November 22, 2023: Big Data Conference. Hybrid
https://bigdataconference.eu/ https://events.pinetool.ai/3079/#sessions/101077
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
Code
Tools
- https://github.com/badlogic/heissepreise
- https://extism.org/
- https://ambient.run/
- https://github.com/microsoft/autogen
- https://www.youtube.com/watch?v=RPz7Xm4fLF4
- https://github.com/lm-sys/FastChat
- https://github.com/cado-security/cloudgrep
- https://www.scylladb.com/2023/10/02/introducing-database-performance-at-scale-a-free-open-source-book/
- https://github.com/ublue-os/obs-studio-portable
- https://distrobox.it/
- https://github.com/SongweiGe/rich-text-to-image
© 2020-2023 Tim Spann