
Sign up to save your podcasts
Or


Pierluca D'Oro and Martin Klissarov on Motif and RLAIF, Noisy Neighborhoods and Return Landscapes, and more!
Pierluca D'Oro is PhD student at Mila and visiting researcher at Meta.
Martin Klissarov is a PhD student at Mila and McGill and research scientist intern at Meta.  
Featured References  
Motif: Intrinsic Motivation from Artificial Intelligence Feedback  
Martin Klissarov*, Pierluca D'Oro*, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff  
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control  
Nate Rahn*, Pierluca D'Oro*, Harley Wiltzer, Pierre-Luc Bacon, Marc G. Bellemare  
To keep doing RL research, stop calling yourself an RL researcher 
Pierluca D'Oro 
 By Robin Ranjit Singh Chauhan
By Robin Ranjit Singh Chauhan4.9
2929 ratings
Pierluca D'Oro and Martin Klissarov on Motif and RLAIF, Noisy Neighborhoods and Return Landscapes, and more!
Pierluca D'Oro is PhD student at Mila and visiting researcher at Meta.
Martin Klissarov is a PhD student at Mila and McGill and research scientist intern at Meta.  
Featured References  
Motif: Intrinsic Motivation from Artificial Intelligence Feedback  
Martin Klissarov*, Pierluca D'Oro*, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff  
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control  
Nate Rahn*, Pierluca D'Oro*, Harley Wiltzer, Pierre-Luc Bacon, Marc G. Bellemare  
To keep doing RL research, stop calling yourself an RL researcher 
Pierluca D'Oro 

30,635 Listeners

2,424 Listeners

1,082 Listeners

433 Listeners

302 Listeners

210 Listeners

198 Listeners

9,810 Listeners

90 Listeners

491 Listeners

208 Listeners

562 Listeners

497 Listeners

40 Listeners

52 Listeners