Data Science at Home

What is wrong with reinforcement learning? (Ep. 82)


Listen Later

Join the discussion on our Discord server

 

After reinforcement learning agents doing great at playing Atari video games, Alpha Go, doing financial trading, dealing with language modeling, let me tell you the real story here.

In this episode I want to shine some light on reinforcement learning (RL) and the limitations that every practitioner should consider before taking certain directions. RL seems to work so well! What is wrong with it?

 

Are you a listener of Data Science at Home podcast?

A reader of the Amethix Blog? 
Or did you subscribe to the Artificial Intelligence at your fingertips newsletter?
In any case let’s stay in touch! 
https://amethix.com/survey/

 

 

References
  • Emergence of Locomotion Behaviours in Rich Environments 
https://arxiv.org/abs/1707.02286
  • Rainbow: Combining Improvements in Deep Reinforcement Learning 
  • https://arxiv.org/abs/1710.02298
  • AlphaGo Zero: Starting from scratch 
  • https://deepmind.com/blog/article/alphago-zero-starting-scratch
    ...more
    View all episodesView all episodes
    Download on the App Store

    Data Science at HomeBy Francesco Gadaleta

    • 4.2
    • 4.2
    • 4.2
    • 4.2
    • 4.2

    4.2

    72 ratings


    More shows like Data Science at Home

    View all
    More or Less by BBC Radio 4

    More or Less

    891 Listeners

    WSJ Tech News Briefing by The Wall Street Journal

    WSJ Tech News Briefing

    1,639 Listeners

    Software Engineering Daily by Software Engineering Daily

    Software Engineering Daily

    622 Listeners

    Talk Python To Me by Michael Kennedy

    Talk Python To Me

    585 Listeners

    BBC Inside Science by BBC Radio 4

    BBC Inside Science

    413 Listeners

    Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

    Super Data Science: ML & AI Podcast with Jon Krohn

    303 Listeners

    FT Tech Tonic by Financial Times

    FT Tech Tonic

    99 Listeners

    Worklife with Adam Grant by TED

    Worklife with Adam Grant

    9,159 Listeners

    Practical AI by Practical AI LLC

    Practical AI

    207 Listeners

    Last Week in AI by Skynet Today

    Last Week in AI

    306 Listeners

    Hard Fork by The New York Times

    Hard Fork

    5,509 Listeners

    This Day in AI Podcast by Michael Sharkey, Chris Sharkey

    This Day in AI Podcast

    227 Listeners

    The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

    The AI Daily Brief: Artificial Intelligence News and Analysis

    611 Listeners

    Unhedged by Financial Times & Pushkin Industries

    Unhedged

    181 Listeners

    The Last Invention by Longview

    The Last Invention

    1,086 Listeners