Machine Learning Street Talk (MLST)

When AI Discovers The Next Transformer - Robert Lange (Sakana)


Listen Later

Robert Lange, founding researcher at Sakana AI, joins Tim to discuss *Shinka Evolve* — a framework that combines LLMs with evolutionary algorithms to do open-ended program search. The core claim: systems like AlphaEvolve can optimize solutions to fixed problems, but real scientific progress requires co-evolving the problems themselves.


GTC is coming, the premier AI conference, great opportunity to learn about AI. NVIDIA and partners will showcase breakthroughs in physical AI, AI factories, agentic AI, and inference, exploring the next wave of AI innovation for developers and researchers. Register for virtual GTC for free, using my link and win NVIDIA DGX Spark (https://nvda.ws/4qQ0LMg)


• Why AlphaEvolve gets stuck — it needs a human to hand it the right problem. Shinka tries to invent new problems automatically, drawing on ideas from POET, PowerPlay, and MAP-Elites quality-diversity search.


• The *architecture* of Shinka: an archive of programs organized as islands, LLMs used as mutation operators, and a UCB bandit that adaptively selects between frontier models (GPT-5, Sonnet 4.5, Gemini) mid-run. The credit-assignment problem across models turns out to be genuinely hard.


• Concrete results — state-of-the-art circle packing with dramatically fewer evaluations, second place in an AtCoder competitive programming challenge, evolved load-balancing loss functions for mixture-of-experts models, and agent scaffolds for AIME math benchmarks.


• Are these systems actually thinking outside the box, or are they parasitic on their starting conditions? When LLMs run autonomously, "nothing interesting happens." Robert pushes back with the stepping-stone argument — evolution doesn't need to extrapolate, just recombine usefully.


• The AI Scientist question: can automated research pipelines produce real science, or just workshop-level slop that passes surface-level review? Robert is honest that the current version is more co-pilot than autonomous researcher.


• Where this lands in 5-20 years — Robert's prediction that scientific research will be fundamentally transformed, and Tim's thought experiment about alien mathematical artifacts that no human could have conceived.


Robert Lange: https://roberttlange.com/


---

TIMESTAMPS:

00:00:00 Introduction: Robert Lange, Sakana AI and Shinka Evolve

00:04:15 AlphaEvolve's Blind Spot: Co-Evolving Problems with Solutions

00:09:05 Unknown Unknowns, POET, and Auto-Curricula for AI Science

00:14:20 MAP-Elites and Quality-Diversity: Shinka's Evolutionary Architecture

00:28:00 UCB Bandits, Mutations and the Vibe Research Vision

00:40:00 Scaling Shinka: Meta-Evolution, Democratisation and the Three-Axis Model

00:47:10 Applications, ARC-AGI and the Future of Work

00:57:00 The AI Scientist and the Human Co-Pilot: Who Steers the Search?

01:06:00 AI Scientist v2, Slop Critique and the Future of Scientific Publishing


---

REFERENCES:

paper:

[00:03:30] ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution

https://arxiv.org/abs/2509.19349

[00:04:15] AlphaEvolve: A Coding Agent for Scientific and Algorithmic Discovery

https://arxiv.org/abs/2506.13131

[00:06:30] Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

https://arxiv.org/abs/2505.22954

[00:09:05] Paired Open-Ended Trailblazer (POET)

https://arxiv.org/abs/1901.01753

[00:10:00] PowerPlay: Training an Increasingly General Problem Solver by Continually Searching for the Simplest Still Unsolvable Problem

https://arxiv.org/abs/1112.5309

[00:10:40] Automated Capability Discovery via Foundation Model Self-Exploration

https://arxiv.org/abs/2502.07577

[00:15:30] Illuminating Search Spaces by Mapping Elites (MAP-Elites)

https://arxiv.org/abs/1504.04909

[00:47:10] Automated Design of Agentic Systems (ADAS)

https://arxiv.org/abs/2408.08435


PDF : https://app.rescript.info/api/sessions/b8a9dcf60623657c/pdf/download

Transcript: https://app.rescript.info/public/share/SDOD_3oXOcli3zTqcAtR8eibT5U3gam84oo4KRtI-Vk

...more
View all episodesView all episodes
Download on the App Store

Machine Learning Street Talk (MLST)By Machine Learning Street Talk (MLST)

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

95 ratings


More shows like Machine Learning Street Talk (MLST)

View all
The a16z Show by Andreessen Horowitz

The a16z Show

1,105 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

441 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

305 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

345 Listeners

Practical AI by Practical AI LLC

Practical AI

209 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

205 Listeners

Last Week in AI by Skynet Today

Last Week in AI

314 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

552 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

513 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

148 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

102 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

229 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

688 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

476 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners