
Sign up to save your podcasts
Or


Are we witnessing the first real signs of AI becoming a scientist? In this episode of The MAD Podcast, Matt Turck sits down with Dan Roberts, lead of the Foundations of Reinforcement Learning team at OpenAI, to explore one of the biggest shifts happening in AI: the rise of reasoning models, test-time compute, and reinforcement learning as engines of scientific discovery. Dan brings a rare perspective - from theoretical physics, black holes, quantum information, and deep learning theory - to explain how models are learning to “think,” why language may be such a powerful foundation for intelligence, what recent AI math breakthroughs really mean, and whether we are beginning to see AI systems that can contribute to science itself.
(00:00) Intro: AI's wild week in mathematics
(01:21) What OpenAI's Foundations of RL team does
(03:08) Dan's journey: from black holes and quantum gravity to frontier AI
(07:04) Are AI systems becoming useful for real science?
(08:21) The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic
(08:52) Why the OpenAI result was an act of exploration
(10:25) OpenAI vs. DeepMind: informal reasoning vs. formal proof
(12:13) RL 101: learning by doing, not just watching
(15:10) Why reinforcement learning works
(15:58) How RL breaks: sparse feedback and long-horizon tasks
(17:03) RLHF: how human feedback shaped early language models
(18:48) Move 37, self-play, and the search for novel strategies
(22:16) Explore vs. exploit in scientific discovery
(24:49) Why RL may now be "the cake," not the cherry on top
(25:46) Why RL started working with large language models
(27:29) Is RL "sucking supervision through a straw"?
(28:47) Why language may be the grounding layer for intelligence
(31:46) A contrarian take on the Bitter Lesson
(32:41) What test-time compute actually is
(34:50) How RL gives models the ability to think
(35:40) Verifiable rewards, math, coding, and the messy real world
(38:00) What physics can teach us about AI
(42:08) Is there a thermodynamics of AI?
(43:08) From Erdős problems to Einstein-level AI
(45:16) Is AI already doing original science?
(45:51) How far are we from AI automating AI research?
(47:41) Why Dan is excited about the future of science
By Matt Turck5
2424 ratings
Are we witnessing the first real signs of AI becoming a scientist? In this episode of The MAD Podcast, Matt Turck sits down with Dan Roberts, lead of the Foundations of Reinforcement Learning team at OpenAI, to explore one of the biggest shifts happening in AI: the rise of reasoning models, test-time compute, and reinforcement learning as engines of scientific discovery. Dan brings a rare perspective - from theoretical physics, black holes, quantum information, and deep learning theory - to explain how models are learning to “think,” why language may be such a powerful foundation for intelligence, what recent AI math breakthroughs really mean, and whether we are beginning to see AI systems that can contribute to science itself.
(00:00) Intro: AI's wild week in mathematics
(01:21) What OpenAI's Foundations of RL team does
(03:08) Dan's journey: from black holes and quantum gravity to frontier AI
(07:04) Are AI systems becoming useful for real science?
(08:21) The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic
(08:52) Why the OpenAI result was an act of exploration
(10:25) OpenAI vs. DeepMind: informal reasoning vs. formal proof
(12:13) RL 101: learning by doing, not just watching
(15:10) Why reinforcement learning works
(15:58) How RL breaks: sparse feedback and long-horizon tasks
(17:03) RLHF: how human feedback shaped early language models
(18:48) Move 37, self-play, and the search for novel strategies
(22:16) Explore vs. exploit in scientific discovery
(24:49) Why RL may now be "the cake," not the cherry on top
(25:46) Why RL started working with large language models
(27:29) Is RL "sucking supervision through a straw"?
(28:47) Why language may be the grounding layer for intelligence
(31:46) A contrarian take on the Bitter Lesson
(32:41) What test-time compute actually is
(34:50) How RL gives models the ability to think
(35:40) Verifiable rewards, math, coding, and the messy real world
(38:00) What physics can teach us about AI
(42:08) Is there a thermodynamics of AI?
(43:08) From Erdős problems to Einstein-level AI
(45:16) Is AI already doing original science?
(45:51) How far are we from AI automating AI research?
(47:41) Why Dan is excited about the future of science

536 Listeners

1,105 Listeners

2,342 Listeners

616 Listeners

233 Listeners

10,254 Listeners

101 Listeners

551 Listeners

512 Listeners

150 Listeners

101 Listeners

34 Listeners

97 Listeners

140 Listeners

42 Listeners