June 04, 2026

OpenAI's Dan Roberts: Why AI Can Now Make Discoveries

49 minutes

Are we witnessing the first real signs of AI becoming a scientist? In this episode of The MAD Podcast, Matt Turck sits down with Dan Roberts, lead of the Foundations of Reinforcement Learning team at OpenAI, to explore one of the biggest shifts happening in AI: the rise of reasoning models, test-time compute, and reinforcement learning as engines of scientific discovery. Dan brings a rare perspective - from theoretical physics, black holes, quantum information, and deep learning theory - to explain how models are learning to “think,” why language may be such a powerful foundation for intelligence, what recent AI math breakthroughs really mean, and whether we are beginning to see AI systems that can contribute to science itself.

(00:00) Intro: AI's wild week in mathematics

(01:21) What OpenAI's Foundations of RL team does

(03:08) Dan's journey: from black holes and quantum gravity to frontier AI

(07:04) Are AI systems becoming useful for real science?

(08:21) The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic

(08:52) Why the OpenAI result was an act of exploration

(10:25) OpenAI vs. DeepMind: informal reasoning vs. formal proof

(12:13) RL 101: learning by doing, not just watching

(15:10) Why reinforcement learning works

(15:58) How RL breaks: sparse feedback and long-horizon tasks

(17:03) RLHF: how human feedback shaped early language models

(18:48) Move 37, self-play, and the search for novel strategies

(22:16) Explore vs. exploit in scientific discovery

(24:49) Why RL may now be "the cake," not the cherry on top

(25:46) Why RL started working with large language models

(27:29) Is RL "sucking supervision through a straw"?

(28:47) Why language may be the grounding layer for intelligence

(31:46) A contrarian take on the Bitter Lesson

(32:41) What test-time compute actually is

(34:50) How RL gives models the ability to think

(35:40) Verifiable rewards, math, coding, and the messy real world

(38:00) What physics can teach us about AI

(42:08) Is there a thermodynamics of AI?

(43:08) From Erdős problems to Einstein-level AI

(45:16) Is AI already doing original science?

(45:51) How far are we from AI automating AI research?

(47:41) Why Dan is excited about the future of science

...more

View all episodes

By Matt Turck

2424 ratings

June 04, 2026

OpenAI's Dan Roberts: Why AI Can Now Make Discoveries

49 minutes

(00:00) Intro: AI's wild week in mathematics

(01:21) What OpenAI's Foundations of RL team does

(03:08) Dan's journey: from black holes and quantum gravity to frontier AI

(07:04) Are AI systems becoming useful for real science?

(08:21) The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic

(08:52) Why the OpenAI result was an act of exploration

(10:25) OpenAI vs. DeepMind: informal reasoning vs. formal proof

(12:13) RL 101: learning by doing, not just watching

(15:10) Why reinforcement learning works

(15:58) How RL breaks: sparse feedback and long-horizon tasks

(17:03) RLHF: how human feedback shaped early language models

(18:48) Move 37, self-play, and the search for novel strategies

(22:16) Explore vs. exploit in scientific discovery

(24:49) Why RL may now be "the cake," not the cherry on top

(25:46) Why RL started working with large language models

(27:29) Is RL "sucking supervision through a straw"?

(28:47) Why language may be the grounding layer for intelligence

(31:46) A contrarian take on the Bitter Lesson

(32:41) What test-time compute actually is

(34:50) How RL gives models the ability to think

(35:40) Verifiable rewards, math, coding, and the messy real world

(38:00) What physics can teach us about AI

(42:08) Is there a thermodynamics of AI?

(43:08) From Erdős problems to Einstein-level AI

(45:16) Is AI already doing original science?

(45:51) How far are we from AI automating AI research?

(47:41) Why Dan is excited about the future of science

...more

More shows like The MAD Podcast with Matt Turck

View all

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

536 Listeners

The a16z Show

1,105 Listeners

Invest Like the Best with Patrick O'Shaughnessy

2,342 Listeners

Azeem Azhar's Exponential View

616 Listeners

Y Combinator Startup Podcast

233 Listeners

All-In with Chamath, Jason, Sacks & Friedberg

10,254 Listeners

Machine Learning Street Talk (MLST)

101 Listeners

Dwarkesh Podcast

551 Listeners

Big Technology Podcast

512 Listeners

No Priors: Artificial Intelligence | Technology | Startups

150 Listeners

Latent Space: The AI Engineer Podcast

101 Listeners

AI + a16z

34 Listeners

Sharp Tech with Ben Thompson

97 Listeners

TBPN

140 Listeners

Uncapped with Jack Altman

42 Listeners

Share OpenAI's Dan Roberts: Why AI Can Now Make Discoveries

Sign up to save your podcasts

OpenAI's Dan Roberts: Why AI Can Now Make Discoveries

OpenAI's Dan Roberts: Why AI Can Now Make Discoveries

More shows like The MAD Podcast with Matt Turck

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The a16z Show

Invest Like the Best with Patrick O'Shaughnessy

Azeem Azhar's Exponential View

Y Combinator Startup Podcast

All-In with Chamath, Jason, Sacks & Friedberg

Machine Learning Street Talk (MLST)

Dwarkesh Podcast

Big Technology Podcast

No Priors: Artificial Intelligence | Technology | Startups

Latent Space: The AI Engineer Podcast

AI + a16z

Sharp Tech with Ben Thompson

TBPN

Uncapped with Jack Altman