Weaviate Podcast

Patrick Lewis on Retrieval-Augmented Generation - Weaviate Podcast #76!


Listen Later

Hey everyone, I am SUPER excited to present our 76th Weaviate Podcast featuring Patrick Lewis, an NLP Research Scientist at Cohere! Patrick has had an absolutely massive impact on Natural Language Processing with AI and Deep Learning! Especially notable for the current climate in AI and Weaviate is that Patrick is the lead author of the original "Retrieval-Augmented Generation" paper!! Patrick has contributed to many other profoundly impactful papers in the space as well such as DPR, Atlas, Task-Aware Retrieval with Instruction, and many many others! This was such an illuminating conversation, here is a quick overview of the chapters in the podcast!

1. Origin of RAG - Patrick explains the build-up that lead to the RAG paper, AskJeeves, IBM Watson, conceptual shift to retrieve-read in mainstream connectionist approaches to AI.
2. Atlas - Atlas shows that a much smaller LLM when paired with Retrieval-Augmentation can still achieve competitive few-shot and zero-shot task performance. This is super impactful because this few-shot and zero-shot capability has been a massive evangelist for AI broadly, and the fact that smaller Retrieval-Augmented models can do this is massive for the economically unlocking these applications.
Teasing apart some architectural details of RAG:
3. Fusion In-Decoder - Interesting encoder-decoder transformer design in which each document + the query is encoded separately, then concatenated and passed to the LM.
4. End-to-End RAG - How to think about jointly training an embedding model and an LLM augmented with retrieval?
5. Query Routers - How to route queries from say SQL or Vector DBs? (More nuance on this later with Multi-Index Retrieval)
6. ConcurrentQA - Super interesting work on the privacy of multi-index routers. For example, if you ask "Who is the father of our new CEO" - this may reveal the private information of the new CEO with the public query of their father.
7. Multi-Index Retrieval
8. New APIs for LLMs
9. Self-Instructed Gorillas
10. Task-Aware Retrieval with Instructions
11. Editing Text, EditEval and PEER
12. What future direction excites you the most?
Links:
Learn more about Patrick Lewis: https://www.patricklewis.io/
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks: https://arxiv.org/abs/2005.11401
Atlas: https://arxiv.org/pdf/2208.03299.pdf
Fusion In-Decoder: https://arxiv.org/pdf/2007.01282.pdf
Chapters
0:00 Welcome Patrick Lewis!
0:36 Origin of RAG
5:20 Atlas
10:43 Fusion In-Decoder
17:50 End-to-End RAG
27:05 Query Routers
32:05 ConcurrentQA
37:30 Multi-Index Retrieval
40:05 New APIs for LLMs
41:50 Self-Instructed Gorillas
44:35 Task-Aware Retrieval with Instructions
52:00 Editing Text, EditEval and PEER
55:35 What future direction excites you the most?

...more
View all episodesView all episodes
Download on the App Store

Weaviate PodcastBy Weaviate

  • 4
  • 4
  • 4
  • 4
  • 4

4

4 ratings


More shows like Weaviate Podcast

View all
a16z Podcast by Andreessen Horowitz

a16z Podcast

1,009 Listeners

Data Skeptic by Kyle Polich

Data Skeptic

475 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

525 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

439 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

295 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

214 Listeners

My First Million by Hubspot Media

My First Million

2,616 Listeners

Last Week in AI by Skynet Today

Last Week in AI

271 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

8,385 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

92 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

320 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

106 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

70 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

397 Listeners