FLaNK Stack Weekly for 22 January 2024
22-January-2024
FLaNK Stack Weekly
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
Get your new Apache NiFi for Dummies!
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #121 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
https://www.cloudera.com/solutions/dim-developer.html
Articles
Writing A Gen AI Processor with Python https://medium.com/@tspann/writing-a-generative-ai-python-processor-ed0655cf4e3f
Codeless Generative AI Pipelines with Chroma Vector DB & Apache NiFi https://medium.com/@tspann/codeless-generative-ai-pipelines-with-chroma-vector-db-apache-nifi-43e77d75952f
Using NiFi to Augment and Enrich LLM Results with Real-Time Contextual Data https://medium.com/@tspann/augmenting-and-enriching-llm-with-real-time-context-b6da7ba4960a
Web AI Testing with Chrome https://developer.chrome.com/blog/supercharge-web-ai-testing
What is TinyML? https://www.ikkaro.net/what-tinyml-is/
Watch that DNS https://rmoff.net/2024/01/16/hosting-on-github-pages-watch-out-for-subdomain-hijacking/
Which Gen AI to Use? https://artificialanalysis.ai/
Implementing RAG with HuggingFace https://medium.com/international-school-of-ai-data-science/implementing-rag-with-langchain-and-hugging-face-28e3ea66c5f7
Kafka on K8 https://engineering.grab.com/kafka-on-kubernetes?
Fix Busted PiP https://medium.com/@RyanHiebert/how-i-fixed-a-pip-compile-dependency-resolution-error-c09305e107e2
NiFi in Kafka Connect https://www.cloudera.com/content/dam/www/marketing/resources/webinars/emea-how-to-run-nifi-flows-in-kafka-kconnect.png.landing.html
Redhat with Cloudera for Generative AI https://www.redhat.com/en/blog/unlocking-power-generative-ai-cloudera-data-platform-and-red-hat-openshift
Videos
Unlocking Financial Data with Real-Time Pipelines (OSACon 2023) https://www.youtube.com/watch?v=Q7gF7m4yFi4&ab_channel=OSACon
Auto Generate NiFi Flows from Natural Language by Mark Payne https://www.youtube.com/watch?v=3oRnUdE7x7w
Looking at the New Features of Apache NiFi (Halifax Community over Code) https://www.youtube.com/watch?v=_orD9aAXk48&ab_channel=TheASF
Utilizing Real-Time Transit Data for Travel Optimization (Halifax Community over Code) Sunday Oct 8 2023, Canada https://www.youtube.com/watch?v=OWQmeF-UeEc&ab_channel=TheASF
Continuous SQL with Kafka and Flink | Timothy Spann (EN) https://www.youtube.com/watch?v=IGs0k240zhU&ab_channel=JAVAPRO
Events
Open Source Finance Forum. Virtual. https://resources.finos.org/znglist/osff-2023-virtual-presentations/?c=cG9zdDo5OTEzOTk%3D&utm_campaign=OSFF+NYC+2023&utm_content=269713979&utm_medium=social&utm_source=linkedin&hss_channel=lcp-18473937
Feb 8, 2024: NYC.
https://www.meetup.com/new-york-open-source-data-infrastructure-meetup/events/297484047/
18:00 - 18:30 Welcome: Networking & snacks 18:30 - 18:35 Kickoff: Welcome Aiven 18:35 - 19:00 A Guide to Product Experimentation (Erin Mikail Staples, LaunchDarkly) 19:00 - 19:30 Building Real-time Pipelines: A Case Study with Transit Data (Tim Spann, Cloudera) 19:30 ~ 21:00 Food & networking
Feb 2024: Webinar
Feb 28, 2024: NYC. Cloudera Meetup. Flink https://www.meetup.com/futureofdata-princeton/events/298661947/
March 15, 2024: Princeton. IT Professional Conference at Trenton Computer Festival IEEE Information Technology Professional Conference on Friday, March 15th, 2024 https://princetonacm.acm.org/tcfpro/
April 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe
Code
- https://github.com/tspannhw/FLaNK-python-watsonx-processor
- https://github.com/tspannhw/FLaNK-CDW
- https://github.com/tspannhw/FLaNK-VectorDB
- https://github.com/tspannhw/FLaNK-RPI5
- https://github.com/tspannhw/FLaNK-EdgeAI
- https://github.com/kevinbtalbert/NiFi-Flows-Demos
- https://github.com/DataSQRL/apirag
Models
- https://github.com/apple/ml-ferret
- https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF
- https://github.com/kevinbtalbert/Electric_and_Utilities_System_Demo
- https://stability.ai/news/stable-code-2024-llm-code-completion-release
- https://clay-foundation.github.io/model/
- https://github.com/speechbrain/speechbrain
- https://huggingface.co/thenlper/gte-large
- https://github.com/SeanLee97/AnglE
- https://huggingface.co/WhereIsAI/UAE-Large-V1
- https://huggingface.co/stabilityai/stablelm-2-1_6b
- https://github.com/jzhang38/TinyLlama
- https://huggingface.co/tiiuae/falcon-7b
Tools
- https://github.com/timfraedrich/OutRun
- https://projectnessie.org/
- https://github.com/KRTirtho/spotube
- https://textart.sh/
- https://developer.spotify.com/documentation/web-api
- https://nightshade.cs.uchicago.edu/downloads.html
- https://barkeywolf.consulting/posts/barcode-scanner-webassembly/#meet-zbar
- https://github.com/kffl/speedbump
- https://github.com/stevekrenzel/pick-ems
- https://tratt.net/laurie/blog/2024/faster_shell_startup_with_shell_switching.html
- https://github.com/polyzos/stream-processing-with-apache-flink
- https://gptcache.readthedocs.io/en/latest/bootcamp/langchain/qa_generation.html
- https://jliljebl.github.io/flowblade/index.html
- https://willowprotocol.org/
- https://nitro.unjs.io/
- https://github.com/OPCFoundation/UA-EdgeTranslator
- https://www.open62541.org/
- https://pypi.org/project/pinecone-client/
- https://www.plotteus.dev/
- https://github.com/serversideup/spin
- https://github.com/Portkey-AI/gateway
- https://maven.apache.org/docs/4.0.0-alpha-12/release-notes.html
- https://github.com/openremote/openremote
- https://github.com/fugue-project/fugue
- https://github.com/apache/flink-kubernetes-operator
- https://github.com/dai-shi/excalidraw-claymate
- https://github.com/whylabs/langkit
- https://github.com/clastix/kamaji
- https://github.com/milvus-io/bootcamp
- https://github.com/deepset-ai/haystack-cookbook
- https://github.com/sgl-project/sglang
- https://github.com/georgevetticaden/evernote-ai-chatbot
- https://github.com/IBM/watsonxdata-python-sdk
- https://mermaid.live/
- https://github.com/dennislee22/deepspeed-train-CML
- https://github.com/gabrielchua/RAGxplorer
- https://chromeenterprise.google/os/chromeosflex/
© 2020-2024 Tim Spann