Data Science at Home

What is wrong with reinforcement learning? (Ep. 82)


Listen Later

Join the discussion on our Discord server

 

After reinforcement learning agents doing great at playing Atari video games, Alpha Go, doing financial trading, dealing with language modeling, let me tell you the real story here.

In this episode I want to shine some light on reinforcement learning (RL) and the limitations that every practitioner should consider before taking certain directions. RL seems to work so well! What is wrong with it?

 

Are you a listener of Data Science at Home podcast?

A reader of the Amethix Blog? 
Or did you subscribe to the Artificial Intelligence at your fingertips newsletter?
In any case let’s stay in touch! 
https://amethix.com/survey/

 

 

References
  • Emergence of Locomotion Behaviours in Rich Environments 
https://arxiv.org/abs/1707.02286
  • Rainbow: Combining Improvements in Deep Reinforcement Learning 
  • https://arxiv.org/abs/1710.02298
  • AlphaGo Zero: Starting from scratch 
  • https://deepmind.com/blog/article/alphago-zero-starting-scratch
    ...more
    View all episodesView all episodes
    Download on the App Store

    Data Science at HomeBy Francesco Gadaleta

    • 4.2
    • 4.2
    • 4.2
    • 4.2
    • 4.2

    4.2

    72 ratings


    More shows like Data Science at Home

    View all
    Radiolab by WNYC Studios

    Radiolab

    43,843 Listeners

    TED Talks Daily by TED

    TED Talks Daily

    11,267 Listeners

    Learning English Conversations by BBC Radio

    Learning English Conversations

    1,063 Listeners

    Stuff You Should Know by iHeartPodcasts

    Stuff You Should Know

    77,233 Listeners

    Data Skeptic by Kyle Polich

    Data Skeptic

    474 Listeners

    Talk Python To Me by Michael Kennedy

    Talk Python To Me

    584 Listeners

    AWS Podcast by Amazon Web Services

    AWS Podcast

    200 Listeners

    Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

    Super Data Science: ML & AI Podcast with Jon Krohn

    295 Listeners

    Learning English from the News by BBC Radio

    Learning English from the News

    249 Listeners

    DataFramed by DataCamp

    DataFramed

    267 Listeners

    Practical AI by Practical AI LLC

    Practical AI

    196 Listeners

    The Intelligence from The Economist by The Economist

    The Intelligence from The Economist

    2,537 Listeners

    Raport o stanie świata Dariusza Rosiaka by Dariusz Rosiak

    Raport o stanie świata Dariusza Rosiaka

    42 Listeners

    The Ancients by History Hit

    The Ancients

    2,820 Listeners

    Hard Fork by The New York Times

    Hard Fork

    5,367 Listeners