
Sign up to save your podcasts
Or


Ian Osband is a Research scientist at OpenAI (ex DeepMind, Stanford) working on decision making under uncertainty.
We spoke about:
- Information theory and RL
- Exploration, epistemic uncertainty and joint predictions
- Epistemic Neural Networks and scaling to LLMs
Featured References  
Reinforcement Learning, Bit by Bit  
Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen  
From Predictions to Decisions: The Importance of Joint Predictive Distributions
Zheng Wen, Ian Osband, Chao Qin, Xiuyuan Lu, Morteza Ibrahimi, Vikranth Dwaracherla, Mohammad Asghari, Benjamin Van Roy
Epistemic Neural Networks
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy
Approximate Thompson Sampling via Epistemic Neural Networks 
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy
Additional References  
 By Robin Ranjit Singh Chauhan
By Robin Ranjit Singh Chauhan4.9
2929 ratings
Ian Osband is a Research scientist at OpenAI (ex DeepMind, Stanford) working on decision making under uncertainty.
We spoke about:
- Information theory and RL
- Exploration, epistemic uncertainty and joint predictions
- Epistemic Neural Networks and scaling to LLMs
Featured References  
Reinforcement Learning, Bit by Bit  
Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen  
From Predictions to Decisions: The Importance of Joint Predictive Distributions
Zheng Wen, Ian Osband, Chao Qin, Xiuyuan Lu, Morteza Ibrahimi, Vikranth Dwaracherla, Mohammad Asghari, Benjamin Van Roy
Epistemic Neural Networks
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy
Approximate Thompson Sampling via Epistemic Neural Networks 
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy
Additional References  

30,635 Listeners

2,424 Listeners

1,084 Listeners

433 Listeners

302 Listeners

210 Listeners

198 Listeners

9,810 Listeners

90 Listeners

491 Listeners

208 Listeners

562 Listeners

497 Listeners

40 Listeners

52 Listeners