Diaries of a Data Scientist

#22 David vs. Goliath: Open Source Takes on Generative AI Giants - DODS


Listen Later

๐–๐ž๐ฅ๐œ๐จ๐ฆ๐ž ๐›๐š๐œ๐ค ๐ญ๐จ ๐ญ๐ก๐ž ๐„๐ฉ๐ข๐ฌ๐จ๐๐ž 21! ๐ŸŽ™

Have you ever wondered how much control you truly have over your Gen. AI models? What about theย protectionย of your data? ๐Ÿค”

E๐ฉ๐ข๐ฌ๐จ๐๐žย #22ย ๐จ๐Ÿ โ€œ๐ƒ๐ข๐š๐ซ๐ข๐ž๐ฌ ๐จ๐Ÿ ๐š ๐ƒ๐š๐ญ๐š ๐’๐œ๐ข๐ž๐ง๐ญ๐ข๐ฌ๐ญโ€ explores a David vs. Goliath like story: Open source taking on Generative AI Giants.

? Why should you care about models you can runย ๐ฅ๐จ๐œ๐š๐ฅ๐ฅ๐ฒย or in your ownย ๐ฉ๐ซ๐ข๐ฏ๐š๐ญ๐ž ๐œ๐ฅ๐จ๐ฎ๐

? What if you could avoid paying a surplus on each token processed

? And how valuable is the globalย ๐œ๐จ๐ฆ๐ฆ๐ฎ๐ง๐ข๐ญ๐ฒย constantly improving and innovating these models?

IWe also cover relevantย ๐จ๐ฉ๐ž๐ง-๐ฌ๐จ๐ฎ๐ซ๐œ๐ž ๐๐š๐ญ๐š๐ฌ๐ž๐ญ๐ฌย and models forย ๐ญ๐ž๐ฑ๐ญ-๐ญ๐จ-๐ญ๐ž๐ฑ๐ญย andย ๐ญ๐ž๐ฑ๐ญ-๐ญ๐จ-๐ข๐ฆ๐š๐ ๐žย generation. If you're ready to extend your horizon beyond the standard Gen AI providers, this episode is for you!

๐Ÿชฝ Follow Jasmin on LinkedIn:ย https://www.linkedin.com/in/jasmin-weimueller-bsc2018/

๐Ÿชฝ Follow Kate on LinkedIn:ย https://www.linkedin.com/in/kate-nazarova-data-science/

๐Ÿชฝ Subscribe to our official DODS page:ย https://www.linkedin.com/company/diaries-of-data-scientist/

Follow us on Medium๐Ÿ‘‡

๐Ÿ–‡ย Jasminโ€™s Medium page:ย https://medium.com/@JasminWhy

๐Ÿ–‡ย Kateโ€™s Medium page:ย https://medium.com/@Kate_in_DS

Join us on other platforms:

๐ŸŽงย Spotify: ย https://open.spotify.com/show/1DAelRe22W8vBHK7rTU361?si=4e4f3d7bc67546cc

๐ŸŽงย Apple: ย https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://podcasts.apple.com/us/podcast/diaries-of-a-data-scientist/id1710961657&ved=2ahUKEwjhm8PdhMWIAxV6qZUCHYZsCDwQFnoECBsQAQ&usg=AOvVaw1deaPC2MF6aWM69-SKSRoH

๐ŸŽงย Amazon:ย https://amzn.asia/d/7J3UkTE

๐ŸŽงย Podimo:ย https://podimo.com/de/shows/diaries-of-a-data-scientist

๐ŸŽงย Podscribe:ย https://app.podscribe.ai/series/2353052

Useful links & Resources:

State of Open Source AI :https://github.blog/news-insights/research/the-state-of-open-source-and-ai/

LLaMAย https://about.fb.com/news/2024/07/open-source-ai-is-the-path-forward/

**LAION-5B:ย https://huggingface.co/datasets/danielz01/laion-5b**

**The Pile:ย https://huggingface.co/datasets/EleutherAI/pile**

**C4:ย https://huggingface.co/datasets/legacy-datasets/c4**

GPT-Neo / GPT-J:ย https://huggingface.co/docs/transformers/en/model_doc/gpt_neo;ย https://huggingface.co/docs/transformers/en/model_doc/gptj

**Mixtral 8x7B:ย https://huggingface.co/mistralai/Mixtral-8x7B-v0.1**

**BLOOM:ย https://bigscience.huggingface.co/blog/bloom**

**T5:ย https://huggingface.co/docs/transformers/en/model_doc/t5**

**LLaMA:ย https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/**

Stable Diffusion:ย https://huggingface.co/models?other=stable-diffusion

**DALL-E Mini:ย https://wandb.ai/dalle-mini/dalle-mini/reports/DALL-E-Mini-Explained--Vmlldzo4NjIxODA;ย https://github.com/borisdayma/dalle-mini**

**FLUX.1:ย https://www.bentoml.com/blog/a-guide-to-open-source-image-generation-models**

...more
View all episodesView all episodes
Download on the App Store

Diaries of a Data ScientistBy Jasmin and Kate