July 17, 2025

Asimov: Building An Omniscient RL Oracle with ReflectionAI’s Misha Laskin

Listen Later

1 hour 2 minutes

Superintelligence, at least in an academic sense, has already been achieved. But Misha Laskin thinks that the next step towards artificial superintelligence, or ASI, should look both more user and problem-focused. ReflectionAI co-founder and CEO Misha Laskin joins Sarah Guo to introduce Asimov, their new code comprehension agent built on reinforcement learning (RL). Misha talks about creating tools and designing AI agents based on customer needs, and how that influences eval development and the scope of the agent’s memory. The two also discuss the challenges in solving scaling for RL, the future of ASI, and the implications for Google’s “non-acquisition” of Windsurf.

Sign up for new podcasts every week. Email feedback to [email protected]

Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @MishaLaskin | @reflection_ai

Chapters:

00:00 – Misha Laskin Introduction

00:44 – Superintelligence vs. Super Intelligent Autonomous Systems

03:26 – Misha’s Journey from Physics to AI

07:48 – Asimov Product Release

11:52 – What Differentiates Asimov from Other Agents

16:15 – Asimov’s Eval Philosophy

21:52 – The Types of Queries Where Asimov Shines

24:35 – Designing a Team-Wide Memory for Asimov

28:38 – Leveraging Pre-Trained Models

32:47 – The Challenges of Solving Scaling in RL

37:21 – Training Agents in Copycat Software Environments

38:25 – When Will We See ASI?

44:27 – Thoughts on Windsurf’s Non-Acquisition

48:10 – Exploring Non-RL Datasets

55:12 – Tackling Problems Beyond Engineering and Coding

57:54 – Where We’re At in Deploying ASI in Different Fields

01:02:30 – Conclusion

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

No Priors: Artificial Intelligence | Technology | Startups

By Conviction

4.4

119119 ratings

July 17, 2025

Asimov: Building An Omniscient RL Oracle with ReflectionAI’s Misha Laskin

Listen Later

1 hour 2 minutes

Superintelligence, at least in an academic sense, has already been achieved. But Misha Laskin thinks that the next step towards artificial superintelligence, or ASI, should look both more user and problem-focused. ReflectionAI co-founder and CEO Misha Laskin joins Sarah Guo to introduce Asimov, their new code comprehension agent built on reinforcement learning (RL). Misha talks about creating tools and designing AI agents based on customer needs, and how that influences eval development and the scope of the agent’s memory. The two also discuss the challenges in solving scaling for RL, the future of ASI, and the implications for Google’s “non-acquisition” of Windsurf.

Sign up for new podcasts every week. Email feedback to [email protected]

Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @MishaLaskin | @reflection_ai

Chapters:

00:00 – Misha Laskin Introduction

00:44 – Superintelligence vs. Super Intelligent Autonomous Systems

03:26 – Misha’s Journey from Physics to AI

07:48 – Asimov Product Release

11:52 – What Differentiates Asimov from Other Agents

16:15 – Asimov’s Eval Philosophy

21:52 – The Types of Queries Where Asimov Shines

24:35 – Designing a Team-Wide Memory for Asimov

28:38 – Leveraging Pre-Trained Models

32:47 – The Challenges of Solving Scaling in RL

37:21 – Training Agents in Copycat Software Environments

38:25 – When Will We See ASI?

44:27 – Thoughts on Windsurf’s Non-Acquisition

48:10 – Exploring Non-RL Datasets

55:12 – Tackling Problems Beyond Engineering and Coding

57:54 – Where We’re At in Deploying ASI in Different Fields

01:02:30 – Conclusion

...more

More shows like No Priors: Artificial Intelligence | Technology | Startups

This Week in Startups by Jason Calacanis

This Week in Startups

1,288 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

538 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,087 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

433 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

226 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

95 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

501 Listeners

The Logan Bartlett Show by by Redpoint Ventures

The Logan Bartlett Show

188 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

93 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

467 Listeners

AI + a16z by a16z

AI + a16z

35 Listeners

Lightcone Podcast by Y Combinator

Lightcone Podcast

21 Listeners

Training Data by Sequoia Capital

Training Data

39 Listeners

Uncapped with Jack Altman by Alt Capital

Uncapped with Jack Altman

44 Listeners

Cheeky Pint by Stripe

Cheeky Pint

48 Listeners