Large Language Model (LLM) Talk

Retrieval Transformer


Listen Later

The sources describe RETRO (Retrieval-Enhanced Transformer), a language model that enhances its performance by retrieving information from a large database. RETRO uses a key-value store where keys are BERT embeddings of text chunks and values are the text chunks themselves. When processing input, it retrieves similar text chunks from the database to augment the input, allowing it to perform comparably to much larger models. By incorporating this retrieved information through a chunked cross-attention mechanism, RETRO reduces the need to memorize facts and improves its performance on knowledge-intensive tasks. The database contains trillions of tokens.

...more
View all episodesView all episodes
Download on the App Store

Large Language Model (LLM) TalkBy AI-Talk

  • 4
  • 4
  • 4
  • 4
  • 4

4

4 ratings


More shows like Large Language Model (LLM) Talk

View all
Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

303 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

341 Listeners

The Daily by The New York Times

The Daily

112,584 Listeners

Learning English from the News by BBC Radio

Learning English from the News

264 Listeners

Thinking in English by Thomas Wilkinson

Thinking in English

110 Listeners

AI Agents: Top Trend of 2025 - by AIAgentStore.ai by AIAgentStore.ai

AI Agents: Top Trend of 2025 - by AIAgentStore.ai

3 Listeners