Share TalkRL: The Reinforcement Learning Podcast
Share to email
Share to Facebook
Share to X
By Robin Ranjit Singh Chauhan
4.9
2525 ratings
The podcast currently has 61 episodes available.
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Finale Doshi-Velez is a Professor at the Harvard Paulson School of Engineering and Applied Sciences.
This off-the-cuff interview was recorded at UMass Amherst during the workshop day of RL Conference on August 9th 2024.
Host notes: I've been a fan of some of Prof Doshi-Velez' past work on clinical RL and hoped to feature her for some time now, so I jumped at the chance to get a few minutes of her thoughts -- even though you can tell I was not prepared and a bit flustered tbh. Thanks to Prof Doshi-Velez for taking a moment for this, and I hope to cross paths in future for a more in depth interview.
References
Thanks to Professor Silver for permission to record this discussion after his RLC 2024 keynote lecture.
Recorded at UMass Amherst during RCL 2024.
Due to the live recording environment, audio quality varies. We publish this audio in its raw form to preserve the authenticity and immediacy of the discussion.
References
David Silver is a principal research scientist at DeepMind and a professor at University College London.
This interview was recorded at UMass Amherst during RLC 2024.
References
Dr. Vincent Moens is an Applied Machine Learning Research Scientist at Meta, and an author of TorchRL and TensorDict in pytorch.
Featured References
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni De Fabritiis, Vincent Moens
Additional References
Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI.
Featured Reference
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs
Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker
Additional References
The podcast currently has 61 episodes available.
958 Listeners
471 Listeners
2,295 Listeners
444 Listeners
291 Listeners
288 Listeners
188 Listeners
182 Listeners
199 Listeners
7,176 Listeners
84 Listeners
207 Listeners
50 Listeners
57 Listeners
331 Listeners