The Information Bottleneck

EP13: Recurrent-Depth Models and Latent Reasoning with Jonas Geiping


Listen Later

In this episode, we host Jonas Geiping from ELLIS Institute & Max-Planck Institute for Intelligent Systems, Tübingen AI Center, Germany. We talked about his broad research on Recurrent-Depth Models and latent reasoning in large language models (LLMs). We talked about what these models can and can't do, what are the challenges and next breakthroughs in the field, world models, and the future of developing better models. We also talked about safety and interpretability, and the role of scaling laws in AI development.

Chapters

00:00 Introduction and Guest Introduction

01:03 Peer Review in Preprint Servers

06:57 New Developments in Coding Models

09:34 Open Source Models in Europe

11:00 Dynamic Layers in LLMs

26:05 Training Playbook Insights

30:05 Recurrent Depth Models and Reasoning Tasks

43:59 Exploring Recursive Reasoning Models

46:46 The Role of World Models in AI

48:41 Innovations in AI Training and Simulation

50:39 The Promise of Recurrent Depth Models

52:34 Navigating the Future of AI Algorithms

54:44 The Bitter Lesson of AI Development

59:11 Advising the Next Generation of Researchers

01:06:42 Safety and Interpretability in AI Models

01:10:46 Scaling Laws and Their Implications

01:16:19 The Role of PhDs in AI Research

Links and paper:

  • Jonas' website - https://jonasgeiping.github.io/
  • Scaling up test-time compute with latent reasoning: A recurrent depth approach - https://arxiv.org/abs/2502.05171
  • The Smol Training Playbook: The Secrets to Building World-Class LLMs - https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook
  • VaultGemma: A Differentially Private Gemma Model - https://arxiv.org/abs/2510.15001

Music:

“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.

“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.

Changes: trimmed

...more
View all episodesView all episodes
Download on the App Store

The Information BottleneckBy Ravid Shwartz-Ziv & Allen Roush

  • 5
  • 5
  • 5
  • 5
  • 5

5

4 ratings


More shows like The Information Bottleneck

View all
The New Yorker Radio Hour by WNYC Studios and The New Yorker

The New Yorker Radio Hour

6,776 Listeners

Fareed Zakaria GPS by CNN Podcasts

Fareed Zakaria GPS

3,398 Listeners

Macro Voices by Hedge Fund Manager Erik Townsend

Macro Voices

3,073 Listeners

Odd Lots by Bloomberg

Odd Lots

1,978 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,096 Listeners

Practical AI by Practical AI LLC

Practical AI

213 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,226 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

198 Listeners

Last Week in AI by Skynet Today

Last Week in AI

318 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

97 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

561 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

507 Listeners

Moonshots with Peter Diamandis by PHD Ventures

Moonshots with Peter Diamandis

595 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

145 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

147 Listeners