
Sign up to save your podcasts
Or


Hear about why OpenAI cites her work in RLHF and dialog models, approaches to rewards in RLHF, ChatGPT, Industry vs Academia, PsiPhi-Learning, AGI and more!
Dr Natasha Jaques is a Senior Research Scientist at Google Brain.
Featured References
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog 
Natasha Jaques, Asma Ghandeharioun, Judy Hanwen Shen, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard  
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control 
Natasha Jaques, Shixiang Gu, Dzmitry Bahdanau, José Miguel Hernández-Lobato, Richard E. Turner, Douglas Eck  
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning 
Angelos Filos, Clare Lyle, Yarin Gal, Sergey Levine, Natasha Jaques, Gregory Farquhar  
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience 
Marwa Abdulhai, Natasha Jaques, Sergey Levine  
Additional References  
 By Robin Ranjit Singh Chauhan
By Robin Ranjit Singh Chauhan4.9
2929 ratings
Hear about why OpenAI cites her work in RLHF and dialog models, approaches to rewards in RLHF, ChatGPT, Industry vs Academia, PsiPhi-Learning, AGI and more!
Dr Natasha Jaques is a Senior Research Scientist at Google Brain.
Featured References
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog 
Natasha Jaques, Asma Ghandeharioun, Judy Hanwen Shen, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard  
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control 
Natasha Jaques, Shixiang Gu, Dzmitry Bahdanau, José Miguel Hernández-Lobato, Richard E. Turner, Douglas Eck  
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning 
Angelos Filos, Clare Lyle, Yarin Gal, Sergey Levine, Natasha Jaques, Gregory Farquhar  
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience 
Marwa Abdulhai, Natasha Jaques, Sergey Levine  
Additional References  

30,638 Listeners

2,423 Listeners

1,082 Listeners

433 Listeners

302 Listeners

210 Listeners

198 Listeners

9,810 Listeners

89 Listeners

491 Listeners

208 Listeners

560 Listeners

499 Listeners

41 Listeners

52 Listeners