Latent Space: The AI Engineer Podcast

[AIEWF Preview] Multi-Turn RL for Multi-Hour Agents — with Will Brown, Prime Intellect


Listen Later

In an otherwise heavy week packed with Microsoft Build, Google I/O, and OpenAI io, the worst kept secret in biglab land was the launch of Claude 4, particularly the triumphant return of Opus, which many had been clamoring for. We will leave the specific Claude 4 recap to AINews, however we think that both Gemini’s progress on Deep Think this week and Claude 4 represent the next frontier of progress on inference time compute/reasoning (at last until GPT5 ships this summer).

Will Brown’s talk at AIE NYC and open source work on verifiers have made him one of the most prominent voices able to publicly discuss (aka without the vaguepoasting LoRA they put on you when you join a biglab) the current state of the art in reasoning models and where current SOTA research directions lead. We discussed his latest paper on Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment and he has previewed his AIEWF talk on Agentic RL for those with the temerity to power thru bad meetup audio.

Chapters
  • 00:00 Introduction and Episode Overview
  • 02:01 Discussion on Cloud 4 and its Features
  • 04:31 Reasoning and Tool Use in AI Models
  • 07:01 Extended Thinking in Claude and Model Differences
  • 09:31 Speculation on Claude's Extended Thinking
  • 11:01 Challenges and Controversies in AI Model Training
  • 13:31 Technical Highlights and Code Trustworthiness
  • 16:01 Token Costs and Incentives in AI Models
  • 18:31 Thinking Budgets and AI Effort
  • 21:01 Safety and Ethics in AI Model Development
  • 23:31 Anthropic's Approach to AI Safety
  • 26:01 LLM Arena and Evaluation Challenges
  • 28:31 Developing Taste and Direction in AI Research
  • 31:01 Recent Research and Multi-Turn RL
  • 33:31 Tools and Incentives in AI Model Development
  • 36:01 Challenges in Evaluating AI Model Outputs
  • 38:31 Model-Based Rewards and Future Directions
  • 41:01 Wrap-up and Future Plans
...more
View all episodesView all episodes
Download on the App Store

Latent Space: The AI Engineer PodcastBy swyx + Alessio

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

66 ratings


More shows like Latent Space: The AI Engineer Podcast

View all
a16z Podcast by Andreessen Horowitz

a16z Podcast

1,012 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

441 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

295 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

325 Listeners

Practical AI by Practical AI LLC

Practical AI

189 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

200 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

88 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

355 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

123 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

197 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

443 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

445 Listeners

AI + a16z by a16z

AI + a16z

30 Listeners

Lightcone Podcast by Y Combinator

Lightcone Podcast

21 Listeners

Training Data by Sequoia Capital

Training Data

36 Listeners