Gradient Dissent: Conversations on AI

Tim & Heinrich — Democraticizing Reinforcement Learning Research


Listen Later

Since reinforcement learning requires hefty compute resources, it can be tough to keep up without a serious budget of your own. Find out how the team at Facebook AI Research (FAIR) is looking to increase access and level the playing field with the help of NetHack, an archaic rogue-like video game from the late 80s.
Links discussed:
The NetHack Learning Environment:
https://ai.facebook.com/blog/nethack-learning-environment-to-advance-deep-reinforcement-learning/
Reinforcement learning, intrinsic motivation:
https://arxiv.org/abs/2002.12292
Knowledge transfer:
https://arxiv.org/abs/1910.08210
Tim Rocktäschel is a Research Scientist at Facebook AI Research (FAIR) London and a Lecturer in the Department of Computer Science at University College London (UCL). At UCL, he is a member of the UCL Centre for Artificial Intelligence and the UCL Natural Language Processing group. Prior to that, he was a Postdoctoral Researcher in the Whiteson Research Lab, a Stipendiary Lecturer in Computer Science at Hertford College, and a Junior Research Fellow in Computer Science at Jesus College, at the University of Oxford.
https://twitter.com/_rockt
Heinrich Kuttler is an AI and machine learning researcher at Facebook AI Research (FAIR) and before that was a research engineer and team lead at DeepMind.
https://twitter.com/HeinrichKuttler
https://www.linkedin.com/in/heinrich-kuttler/
Topics covered:
0:00 a lack of reproducibility in RL
1:05 What is NetHack and how did the idea come to be?
5:46 RL in Go vs NetHack
11:04 performance of vanilla agents, what do you optimize for
18:36 transferring domain knowledge, source diving
22:27 human vs machines intrinsic learning
28:19 ICLR paper - exploration and RL strategies
35:48 the future of reinforcement learning
43:18 going from supervised to reinforcement learning
45:07 reproducibility in RL
50:05 most underrated aspect of ML, biggest challenges?
Get our podcast on these other platforms:
Apple Podcasts: http://wandb.me/apple-podcasts
Spotify: http://wandb.me/spotify
Google: http://wandb.me/google-podcasts
YouTube: http://wandb.me/youtube
Soundcloud: http://wandb.me/soundcloud
Tune in to our bi-weekly virtual salon and listen to industry leaders and researchers in machine learning share their research:
http://wandb.me/salon
Join our community of ML practitioners where we host AMA's, share interesting projects and meet other people working in Deep Learning:
http://wandb.me/slack
Our gallery features curated machine learning reports by researchers exploring deep learning techniques, Kagglers showcasing winning models, and industry leaders sharing best practices:
https://wandb.ai/gallery
...more
View all episodesView all episodes
Download on the App Store

Gradient Dissent: Conversations on AIBy Lukas Biewald

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

66 ratings


More shows like Gradient Dissent: Conversations on AI

View all
a16z Podcast by Andreessen Horowitz

a16z Podcast

998 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

441 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

295 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

324 Listeners

Practical AI by Practical AI LLC

Practical AI

189 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

203 Listeners

Last Week in AI by Skynet Today

Last Week in AI

281 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

89 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

356 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

125 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

196 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

64 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

420 Listeners

AI + a16z by a16z

AI + a16z

32 Listeners

Training Data by Sequoia Capital

Training Data

37 Listeners