
Sign up to save your podcasts
Or


Take our survey at twimlai.com/survey21!
Today we’re joined by Tim Rocktäschel, a research scientist at Facebook AI Research and an associate professor at University College London (UCL).
Tim’s work focuses on training RL agents in simulated environments, with the goal of these agents being able to generalize to novel situations. Typically, this is done in environments like OpenAI Gym, MuJuCo, or even using Atari games, but these all come with constraints. In Tim’s approach, he utilizes a game called NetHack, which is much more rich and complex than the aforementioned environments.
In our conversation with Tim, we explore the ins and outs of using NetHack as a training environment, including how much control a user has when generating each individual game and the challenges he's faced when deploying the agents. We also discuss his work on MiniHack, an environment creation framework and suite of tasks that are based on NetHack, and future directions for this research.
The complete show notes for this episode can be found at twimlai.com/go/527.
By Sam Charrington4.7
419419 ratings
Take our survey at twimlai.com/survey21!
Today we’re joined by Tim Rocktäschel, a research scientist at Facebook AI Research and an associate professor at University College London (UCL).
Tim’s work focuses on training RL agents in simulated environments, with the goal of these agents being able to generalize to novel situations. Typically, this is done in environments like OpenAI Gym, MuJuCo, or even using Atari games, but these all come with constraints. In Tim’s approach, he utilizes a game called NetHack, which is much more rich and complex than the aforementioned environments.
In our conversation with Tim, we explore the ins and outs of using NetHack as a training environment, including how much control a user has when generating each individual game and the challenges he's faced when deploying the agents. We also discuss his work on MiniHack, an environment creation framework and suite of tasks that are based on NetHack, and future directions for this research.
The complete show notes for this episode can be found at twimlai.com/go/527.

480 Listeners

1,090 Listeners

170 Listeners

303 Listeners

334 Listeners

208 Listeners

201 Listeners

95 Listeners

512 Listeners

130 Listeners

227 Listeners

608 Listeners

25 Listeners

35 Listeners

40 Listeners