January 28, 2025

ReflectionAI Founder Ioannis Antonoglou: From AlphaGo to AGI

Listen Later

52 minutes

Ioannis Antonoglou, founding engineer at DeepMind and co-founder of ReflectionAI, has seen the triumphs of reinforcement learning firsthand. From AlphaGo to AlphaZero and MuZero, Ioannis has built the most powerful agents in the world. Ioannis breaks down key moments in AlphaGo's game against Lee Sodol (Moves 37 and 78), the importance of self-play and the impact of scale, reliability, planning and in-context learning as core factors that will unlock the next level of progress in AI.

Hosted by: Stephanie Zhan and Sonya Huang, Sequoia Capital

Mentioned in this episode:

PPO: Proximal Policy Optimization algorithm developed by DeepMind in game environments. Also used by OpenAI for RLHF in ChatGPT.

MuJoCo: Open source physics engine used to develop PPO

Monte Carlo Tree Search: Heuristic search algorithm used in AlphaGo as well as video compression for YouTube and the self-driving system at Tesla

AlphaZero: The DeepMind model that taught itself from scratch how to master the games of chess, shogi and Go

MuZero: The DeepMind follow up to AlphaZero that mastered games without knowing the rules and able to plan winning strategies in unknown environments

AlphaChem: Chemical Synthesis Planning with Tree Search and Deep Neural Network Policies

DQN: Deep Q-Network, Introduced in 2013 paper, Playing Atari with Deep Reinforcement Learning

AlphaFold: DeepMind model for predicting protein structures for which Demis Hassabis, John Jumper and David Baker won the 2024 Nobel Prize in Chemistry

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Training Data

By Sequoia Capital

4.2

3838 ratings

January 28, 2025

ReflectionAI Founder Ioannis Antonoglou: From AlphaGo to AGI

Listen Later

52 minutes

Ioannis Antonoglou, founding engineer at DeepMind and co-founder of ReflectionAI, has seen the triumphs of reinforcement learning firsthand. From AlphaGo to AlphaZero and MuZero, Ioannis has built the most powerful agents in the world. Ioannis breaks down key moments in AlphaGo's game against Lee Sodol (Moves 37 and 78), the importance of self-play and the impact of scale, reliability, planning and in-context learning as core factors that will unlock the next level of progress in AI.

Hosted by: Stephanie Zhan and Sonya Huang, Sequoia Capital

Mentioned in this episode:

PPO: Proximal Policy Optimization algorithm developed by DeepMind in game environments. Also used by OpenAI for RLHF in ChatGPT.

MuJoCo: Open source physics engine used to develop PPO

Monte Carlo Tree Search: Heuristic search algorithm used in AlphaGo as well as video compression for YouTube and the self-driving system at Tesla

AlphaZero: The DeepMind model that taught itself from scratch how to master the games of chess, shogi and Go

MuZero: The DeepMind follow up to AlphaZero that mastered games without knowing the rules and able to plan winning strategies in unknown environments

AlphaChem: Chemical Synthesis Planning with Tree Search and Deep Neural Network Policies

DQN: Deep Q-Network, Introduced in 2013 paper, Playing Atari with Deep Reinforcement Learning

AlphaFold: DeepMind model for predicting protein structures for which Demis Hassabis, John Jumper and David Baker won the 2024 Nobel Prize in Chemistry

...more

More shows like Training Data

This Week in Startups by Jason Calacanis

This Week in Startups

1,290 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

537 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,093 Listeners

Invest Like the Best with Patrick O'Shaughnessy by Colossus | Investing & Business Podcasts

Invest Like the Best with Patrick O'Shaughnessy

2,354 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

228 Listeners

Practical AI by Daniel Whitenack and Chris Benson

Practical AI

208 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

576 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

143 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

101 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

682 Listeners

Crucible Moments by Sequoia Capital

Crucible Moments

92 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

491 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners

Uncapped with Jack Altman by Alt Capital

Uncapped with Jack Altman

43 Listeners

OpenAI Podcast by OpenAI

OpenAI Podcast

58 Listeners