November 25, 2024

Exploring RAG Pipelines with Private AI Foundation and NVIDIA

Listen Later

19 minutes

In this episode of the Virtually Speaking Podcast, we delve into the world of AI with Justin Murray, Product Marketing Engineer, and Frank Denneman, Chief Technologist for AI at Broadcom. We discuss retrieval augmented generation (RAG), a powerful approach that combines large language models with real-time, trusted data. Learn how RAG pipelines can be architected using Private AI Foundation with NVIDIA, including insights into key components like LLMs, NVIDIA Inference Microservices, and Vector DB. We also explore best practices for GPU sizing and when to use fractional or multiple GPUs for optimal performance. Join us for this fascinating conversation!

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Virtually Speaking Podcast

By Virtually Speaking Podcast

4.7

4242 ratings

November 25, 2024

Exploring RAG Pipelines with Private AI Foundation and NVIDIA

Listen Later

19 minutes

In this episode of the Virtually Speaking Podcast, we delve into the world of AI with Justin Murray, Product Marketing Engineer, and Frank Denneman, Chief Technologist for AI at Broadcom. We discuss retrieval augmented generation (RAG), a powerful approach that combines large language models with real-time, trusted data. Learn how RAG pipelines can be architected using Private AI Foundation with NVIDIA, including insights into key components like LLMs, NVIDIA Inference Microservices, and Vector DB. We also explore best practices for GPU sizing and when to use fractional or multiple GPUs for optimal performance. Join us for this fascinating conversation!

...more

More shows like Virtually Speaking Podcast

Intelligent Machines (Audio) by TWiT

Intelligent Machines (Audio)

780 Listeners

Risky Business by Risky Business Media

Risky Business

375 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

180 Listeners

Cybersecurity Headlines by CISO Series

Cybersecurity Headlines

137 Listeners

Unexplored Territory by Duncan Epping

Unexplored Territory

12 Listeners