October 15, 2019

What is wrong with reinforcement learning? (Ep. 82)

Listen Later

21 minutes

Join the discussion on our Discord server

After reinforcement learning agents doing great at playing Atari video games, Alpha Go, doing financial trading, dealing with language modeling, let me tell you the real story here.

In this episode I want to shine some light on reinforcement learning (RL) and the limitations that every practitioner should consider before taking certain directions. RL seems to work so well! What is wrong with it?

Are you a listener of Data Science at Home podcast?

A reader of the Amethix Blog?

Or did you subscribe to the Artificial Intelligence at your fingertips newsletter?

In any case let’s stay in touch!

https://amethix.com/survey/

References

Emergence of Locomotion Behaviours in Rich Environments

https://arxiv.org/abs/1707.02286

Rainbow: Combining Improvements in Deep Reinforcement Learning

https://arxiv.org/abs/1710.02298

AlphaGo Zero: Starting from scratch

https://deepmind.com/blog/article/alphago-zero-starting-scratch

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Data Science at Home

By Francesco Gadaleta

4.2

7272 ratings

October 15, 2019

What is wrong with reinforcement learning? (Ep. 82)

Listen Later

21 minutes

Join the discussion on our Discord server

After reinforcement learning agents doing great at playing Atari video games, Alpha Go, doing financial trading, dealing with language modeling, let me tell you the real story here.

In this episode I want to shine some light on reinforcement learning (RL) and the limitations that every practitioner should consider before taking certain directions. RL seems to work so well! What is wrong with it?

Are you a listener of Data Science at Home podcast?

A reader of the Amethix Blog?

Or did you subscribe to the Artificial Intelligence at your fingertips newsletter?

In any case let’s stay in touch!

https://amethix.com/survey/

References

Emergence of Locomotion Behaviours in Rich Environments

https://arxiv.org/abs/1707.02286

Rainbow: Combining Improvements in Deep Reinforcement Learning

https://arxiv.org/abs/1710.02298

AlphaGo Zero: Starting from scratch

https://deepmind.com/blog/article/alphago-zero-starting-scratch

...more

More shows like Data Science at Home

On Point with Meghna Chakrabarti by WBUR

On Point with Meghna Chakrabarti

4,022 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,380 Listeners

Nature Podcast by Springer Nature Limited

Nature Podcast

756 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

626 Listeners

Science Vs by Spotify Studios

Science Vs

12,130 Listeners

Science Friday by Science Friday and WNYC Studios

Science Friday

6,467 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

306 Listeners

The Daily by The New York Times

The Daily

113,121 Listeners

Up First from NPR by NPR

Up First from NPR

56,944 Listeners

The Atlantic Interview by The Atlantic

The Atlantic Interview

14 Listeners

Modern Wisdom by Chris Williamson

Modern Wisdom

4,025 Listeners

The Peter Attia Drive by Peter Attia, MD

The Peter Attia Drive

8,043 Listeners

Practical AI by Practical AI LLC

Practical AI

212 Listeners

Consider This from NPR by NPR

Consider This from NPR

6,462 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,525 Listeners