The Information Bottleneck

EP13: Recurrent-Depth Models and Latent Reasoning with Jonas Geiping


Listen Later

In this episode, we host Jonas Geiping from ELLIS Institute & Max-Planck Institute for Intelligent Systems, Tübingen AI Center, Germany. We talked about his broad research on Recurrent-Depth Models and latent reasoning in large language models (LLMs). We talked about what these models can and can't do, what are the challenges and next breakthroughs in the field, world models, and the future of developing better models. We also talked about safety and interpretability, and the role of scaling laws in AI development.

Chapters

00:00 Introduction and Guest Introduction

01:03 Peer Review in Preprint Servers

06:57 New Developments in Coding Models

09:34 Open Source Models in Europe

11:00 Dynamic Layers in LLMs

26:05 Training Playbook Insights

30:05 Recurrent Depth Models and Reasoning Tasks

43:59 Exploring Recursive Reasoning Models

46:46 The Role of World Models in AI

48:41 Innovations in AI Training and Simulation

50:39 The Promise of Recurrent Depth Models

52:34 Navigating the Future of AI Algorithms

54:44 The Bitter Lesson of AI Development

59:11 Advising the Next Generation of Researchers

01:06:42 Safety and Interpretability in AI Models

01:10:46 Scaling Laws and Their Implications

01:16:19 The Role of PhDs in AI Research

Links and paper:

  • Jonas' website - https://jonasgeiping.github.io/
  • Scaling up test-time compute with latent reasoning: A recurrent depth approach - https://arxiv.org/abs/2502.05171
  • The Smol Training Playbook: The Secrets to Building World-Class LLMs - https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook
  • VaultGemma: A Differentially Private Gemma Model - https://arxiv.org/abs/2510.15001

Music:

“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.

“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.

Changes: trimmed

...more
View all episodesView all episodes
Download on the App Store

The Information BottleneckBy Ravid Shwartz-Ziv & Allen Roush

  • 5
  • 5
  • 5
  • 5
  • 5

5

4 ratings


More shows like The Information Bottleneck

View all
Odd Lots by Bloomberg

Odd Lots

1,932 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,455 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,091 Listeners

גיקונומי by ראם שרמן ודורון ניר

גיקונומי

91 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

301 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

203 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,942 Listeners

Last Week in AI by Skynet Today

Last Week in AI

306 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

96 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

519 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

132 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

93 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

617 Listeners

Money Stuff: The Podcast by Bloomberg

Money Stuff: The Podcast

393 Listeners

AI + a16z by a16z

AI + a16z

36 Listeners