Large Language Model (LLM) Talk

GPT-3


Listen Later

GPT3 is a large language model that generates text based on its training on a massive dataset of 300 billion tokens. It outputs text one token at a time, influenced by input text. The model encodes what it learns in 175 billion parameters and has a context window of 2048 tokens. The core calculations happen within 96 transformer decoder layers, each with 1.8 billion parameters. Words are converted to vectors, a prediction is made, and the result is converted back to a word. The input flows through the layer stack, with each word fed back into the model. Priming examples are included as input. Fine-tuning can update model weights to improve performance for specific tasks.

...more
View all episodesView all episodes
Download on the App Store

Large Language Model (LLM) TalkBy AI-Talk

  • 4
  • 4
  • 4
  • 4
  • 4

4

4 ratings


More shows like Large Language Model (LLM) Talk

View all
Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

303 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

341 Listeners

The Daily by The New York Times

The Daily

112,584 Listeners

Learning English from the News by BBC Radio

Learning English from the News

264 Listeners

Thinking in English by Thomas Wilkinson

Thinking in English

110 Listeners

AI Agents: Top Trend of 2025 - by AIAgentStore.ai by AIAgentStore.ai

AI Agents: Top Trend of 2025 - by AIAgentStore.ai

3 Listeners