
Sign up to save your podcasts
Or


Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI.
Featured Reference
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs
Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker
Additional References
 By Robin Ranjit Singh Chauhan
By Robin Ranjit Singh Chauhan4.9
2929 ratings
Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI.
Featured Reference
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs
Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker
Additional References

30,635 Listeners

2,424 Listeners

1,082 Listeners

433 Listeners

302 Listeners

210 Listeners

198 Listeners

9,810 Listeners

90 Listeners

491 Listeners

208 Listeners

562 Listeners

497 Listeners

40 Listeners

52 Listeners