Vanishing Gradients

Episode 40: What Every LLM Developer Needs to Know About GPUs


Listen Later

Hugo speaks with Charles Frye, Developer Advocate at Modal and someone who really knows GPUs inside and out. If you’re a data scientist, machine learning engineer, AI researcher, or just someone trying to make sense of hardware for LLMs and AI workflows, this episode is for you.

Charles and Hugo dive into the practical side of GPUs—from running inference on large models, to fine-tuning and even training from scratch. They unpack the real pain points developers face, like figuring out:

  • How much VRAM you actually need.
  • Why memory—not compute—ends up being the bottleneck.
  • How to make quick, back-of-the-envelope calculations to size up hardware for your tasks.
  • And where things like fine-tuning, quantization, and retrieval-augmented generation (RAG) fit into the mix.
  • One thing Hugo really appreciate is that Charles and the Modal team recently put together the GPU Glossary—a resource that breaks down GPU internals in a way that’s actually useful for developers. We reference it a few times throughout the episode, so check it out in the show notes below.

    🔧 Charles also does a demo during the episode—some of it is visual, but we talk through the key points so you’ll still get value from the audio. If you’d like to see the demo in action, check out the livestream linked below.

    This is the "Building LLM Applications for Data Scientists and Software Engineers" course that Hugo is teaching with Stefan Krawczyk (ex-StitchFix) in January. Charles is giving a guest lecture at on hardware for LLMs, and Modal is giving all students $1K worth of compute credits (use the code VG25 for $200 off).

    LINKS

    • The livestream on YouTube
    • The GPU Glossary by the Modal team
    • What We’ve Learned From A Year of Building with LLMs by Charles and friends
    • Charles on twitter
    • Hugo on twitter
    • Vanishing Gradients on twitter
    • ...more
      View all episodesView all episodes
      Download on the App Store

      Vanishing GradientsBy Hugo Bowne-Anderson

      • 5
      • 5
      • 5
      • 5
      • 5

      5

      11 ratings


      More shows like Vanishing Gradients

      View all
      a16z Podcast by Andreessen Horowitz

      a16z Podcast

      1,001 Listeners

      Data Skeptic by Kyle Polich

      Data Skeptic

      470 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      296 Listeners

      DataFramed by DataCamp

      DataFramed

      269 Listeners

      Practical AI by Practical AI LLC

      Practical AI

      190 Listeners

      Last Week in AI by Skynet Today

      Last Week in AI

      281 Listeners

      Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

      Machine Learning Street Talk (MLST)

      88 Listeners

      Dwarkesh Podcast by Dwarkesh Patel

      Dwarkesh Podcast

      354 Listeners

      No Priors: Artificial Intelligence | Technology | Startups by Conviction

      No Priors: Artificial Intelligence | Technology | Startups

      125 Listeners

      This Day in AI Podcast by Michael Sharkey, Chris Sharkey

      This Day in AI Podcast

      190 Listeners

      Latent Space: The AI Engineer Podcast by swyx + Alessio

      Latent Space: The AI Engineer Podcast

      63 Listeners

      The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

      The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

      424 Listeners

      The Next Wave - AI and The Future of Technology by Hubspot Media

      The Next Wave - AI and The Future of Technology

      57 Listeners

      Training Data by Sequoia Capital

      Training Data

      36 Listeners

      High Signal: Data Science | Career | AI by Delphina

      High Signal: Data Science | Career | AI

      4 Listeners