Vanishing Gradients

Episode 50: A Field Guide to Rapidly Improving AI Products -- With Hamel Husain


Listen Later

If we want AI systems that actually work, we need to get much better at evaluating them, not just building more pipelines, agents, and frameworks.

In this episode, Hugo talks with Hamel Hussain (ex-Airbnb, GitHub, DataRobot) about how teams can improve AI products by focusing on error analysis, data inspection, and systematic iteration. The conversation is based on Hamel’s blog post A Field Guide to Rapidly Improving AI Products, which he joined Hugo’s class to discuss.

They cover:

🔍 Why most teams struggle to measure whether their systems are actually improving

📊 How error analysis helps you prioritize what to fix (and when to write evals)

🧮 Why evaluation isn’t just a metric — but a full development process

⚠️ Common mistakes when debugging LLM and agent systems

🛠️ How to think about the tradeoffs in adding more evals vs. fixing obvious issues

👥 Why enabling domain experts — not just engineers — can accelerate iteration

If you’ve ever built an AI system and found yourself unsure how to make it better, this conversation is for you.

LINKS

  • A Field Guide to Rapidly Improving AI Products by Hamel Husain
  • Vanishing Gradients YouTube Channel
  • Upcoming Events on Luma
  • Hugo's recent newsletter about upcoming events and more!
  • 🎓 Learn more:

    • Hugo's course: Building LLM Applications for Data Scientists and Software Engineers — next cohort starts July 8: https://maven.com/s/course/d56067f338
    • Hamel & Shreya's course: AI Evals For Engineers & PMs — use code GOHUGORGOHOME for $800 off
    • 📺 Watch the video version on YouTube: YouTube link

      ...more
      View all episodesView all episodes
      Download on the App Store

      Vanishing GradientsBy Hugo Bowne-Anderson

      • 5
      • 5
      • 5
      • 5
      • 5

      5

      11 ratings


      More shows like Vanishing Gradients

      View all
      a16z Podcast by Andreessen Horowitz

      a16z Podcast

      1,032 Listeners

      Data Skeptic by Kyle Polich

      Data Skeptic

      480 Listeners

      The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

      The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

      441 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      298 Listeners

      NVIDIA AI Podcast by NVIDIA

      NVIDIA AI Podcast

      322 Listeners

      DataFramed by DataCamp

      DataFramed

      267 Listeners

      Practical AI by Practical AI LLC

      Practical AI

      192 Listeners

      Google DeepMind: The Podcast by Hannah Fry

      Google DeepMind: The Podcast

      198 Listeners

      Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

      Machine Learning Street Talk (MLST)

      88 Listeners

      Dwarkesh Podcast by Dwarkesh Patel

      Dwarkesh Podcast

      408 Listeners

      No Priors: Artificial Intelligence | Technology | Startups by Conviction

      No Priors: Artificial Intelligence | Technology | Startups

      121 Listeners

      Latent Space: The AI Engineer Podcast by swyx + Alessio

      Latent Space: The AI Engineer Podcast

      75 Listeners

      AI + a16z by a16z

      AI + a16z

      31 Listeners

      High Signal: Data Science | Career | AI by Delphina

      High Signal: Data Science | Career | AI

      4 Listeners

      OpenAI Podcast by OpenAI

      OpenAI Podcast

      28 Listeners