April 12, 2026

Understanding RAG Systems

Listen Later

28 minutes

SUMMARY: The RAG (Retrieval Augmented Generation) pattern is one of the most frequently used to augment LLMs with context-specific information. Let’s explore RAG.

GUEST: Roie Schwaber-Cohen, Head of Developer Relations at Pinecone

SHOW: 1018

SHOW TRANSCRIPT: The Reasoning Show #1018 Transcript

SHOW VIDEO: https://youtu.be/-kZZEMR341Q

SHOW SPONSORS:

Nasuni - Activate your data for AI and request a demo
ShareGate - ShareGate Protect. Microsoft 365 Governance, we got this!

SHOW NOTES:

Topic 1 - Welcome to the show. Tell us a little bit about your background, and what you focus on these days at Pinecone

Topic 2 - Let’s begin by talking about RAG systems. What are they? Why do companies choose to use them? What benefits do they provide in AI systems?

Topic 3 - At a high level, RAG sounds straightforward—retrieve relevant context, generate an answer. But in practice, where does it break first as systems scale?

Topic 4 - I’ve heard that RAG systems can return answers that are technically correct but fundamentally wrong. What’s a concrete example of that happening in production—and why does it slip past most teams?

Topic 5 - In traditional systems, we assume there’s a single source of truth. But in enterprise environments, ‘truth’ is often versioned, contextual, and conflicting. How should teams rethink ‘truth’ when building AI systems?

Topic 6 - A lot of teams assume their knowledge base is ‘good enough’ for RAG. What do they usually underestimate about the messiness of real enterprise data?

Topic 7 - There’s a growing narrative that better reasoning models can compensate for weaker retrieval. From what you’ve seen, where does that idea fall apart?

Topic 8 - If correctness depends on things like timing, policy scope, or configuration, how should teams design systems that understand context—not just content?

Topic 9 - Looking ahead, what replaces today’s RAG architectures? What patterns are emerging among teams that are actually getting this right?”

FEEDBACK?

Email: show @ reasoning dot show
Bluesky: @reasoningshow.bsky.social
Twitter/X: @ReasoningShow
Instagram: @reasoningshow
TikTok: @reasoningshow

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

The Reasoning Show

By Massive Studios

4.6

147147 ratings

April 12, 2026

Understanding RAG Systems

Listen Later

28 minutes

SUMMARY: The RAG (Retrieval Augmented Generation) pattern is one of the most frequently used to augment LLMs with context-specific information. Let’s explore RAG.

GUEST: Roie Schwaber-Cohen, Head of Developer Relations at Pinecone

SHOW: 1018

SHOW TRANSCRIPT: The Reasoning Show #1018 Transcript

SHOW VIDEO: https://youtu.be/-kZZEMR341Q

SHOW SPONSORS:

Nasuni - Activate your data for AI and request a demo
ShareGate - ShareGate Protect. Microsoft 365 Governance, we got this!

SHOW NOTES:

Topic 1 - Welcome to the show. Tell us a little bit about your background, and what you focus on these days at Pinecone

Topic 2 - Let’s begin by talking about RAG systems. What are they? Why do companies choose to use them? What benefits do they provide in AI systems?

Topic 3 - At a high level, RAG sounds straightforward—retrieve relevant context, generate an answer. But in practice, where does it break first as systems scale?

Topic 4 - I’ve heard that RAG systems can return answers that are technically correct but fundamentally wrong. What’s a concrete example of that happening in production—and why does it slip past most teams?

Topic 5 - In traditional systems, we assume there’s a single source of truth. But in enterprise environments, ‘truth’ is often versioned, contextual, and conflicting. How should teams rethink ‘truth’ when building AI systems?

Topic 6 - A lot of teams assume their knowledge base is ‘good enough’ for RAG. What do they usually underestimate about the messiness of real enterprise data?

Topic 7 - There’s a growing narrative that better reasoning models can compensate for weaker retrieval. From what you’ve seen, where does that idea fall apart?

Topic 8 - If correctness depends on things like timing, policy scope, or configuration, how should teams design systems that understand context—not just content?

Topic 9 - Looking ahead, what replaces today’s RAG architectures? What patterns are emerging among teams that are actually getting this right?”

FEEDBACK?

Email: show @ reasoning dot show
Bluesky: @reasoningshow.bsky.social
Twitter/X: @ReasoningShow
Instagram: @reasoningshow
TikTok: @reasoningshow

...more

More shows like The Reasoning Show

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

288 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,105 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

626 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

583 Listeners

Soft Skills Engineering by Jamison Dance and Dave Smith

Soft Skills Engineering

287 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

306 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

343 Listeners

Tech Brew Ride Home by Morning Brew

Tech Brew Ride Home

964 Listeners

Practical AI by Practical AI LLC

Practical AI

212 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

204 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

140 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

512 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

228 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners

The Pragmatic Engineer by Gergely Orosz

The Pragmatic Engineer

77 Listeners