March 23, 2021

#49 - Meta-Gradients in RL - Dr. Tom Zahavy (DeepMind)

Listen Later

1 hour 25 minutes

The race is on, we are on a collective mission to understand and create artificial general intelligence. Dr. Tom Zahavy, a Research Scientist at DeepMind thinks that reinforcement learning is the most general learning framework that we have today, and in his opinion it could lead to artificial general intelligence. He thinks there are no tasks which could not be solved by simply maximising a reward.

Back in 2012 when Tom was an undergraduate, before the deep learning revolution he attended an online lecture on how CNNs automatically discover representations. This was an epiphany for Tom. He decided in that very moment that he was going to become an ML researcher. Tom's view is that the ability to recognise patterns and discover structure is the most important aspect of intelligence. This has been his quest ever since. He is particularly focused on using diversity preservation and metagradients to discover this structure.

In this discussion we dive deep into meta gradients in reinforcement learning.

Video version and TOC @ https://www.youtube.com/watch?v=hfaZwgk_iS0

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Machine Learning Street Talk (MLST)

By Machine Learning Street Talk (MLST)

4.7

8585 ratings

March 23, 2021

#49 - Meta-Gradients in RL - Dr. Tom Zahavy (DeepMind)

Listen Later

1 hour 25 minutes

The race is on, we are on a collective mission to understand and create artificial general intelligence. Dr. Tom Zahavy, a Research Scientist at DeepMind thinks that reinforcement learning is the most general learning framework that we have today, and in his opinion it could lead to artificial general intelligence. He thinks there are no tasks which could not be solved by simply maximising a reward.

Back in 2012 when Tom was an undergraduate, before the deep learning revolution he attended an online lecture on how CNNs automatically discover representations. This was an epiphany for Tom. He decided in that very moment that he was going to become an ML researcher. Tom's view is that the ability to recognise patterns and discover structure is the most important aspect of intelligence. This has been his quest ever since. He is particularly focused on using diversity preservation and metagradients to discover this structure.

In this discussion we dive deep into meta gradients in reinforcement learning.

Video version and TOC @ https://www.youtube.com/watch?v=hfaZwgk_iS0

...more

More shows like Machine Learning Street Talk (MLST)

Data Skeptic by Kyle Polich

Data Skeptic

478 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

432 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

302 Listeners

Practical AI by Practical AI LLC

Practical AI

212 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

196 Listeners

Last Week in AI by Skynet Today

Last Week in AI

305 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

70 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

131 Listeners

Unsupervised Learning by by Redpoint Ventures

Unsupervised Learning

49 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

95 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

209 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

585 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners

Lightcone Podcast by Y Combinator

Lightcone Podcast

22 Listeners

Training Data by Sequoia Capital

Training Data

39 Listeners