
Sign up to save your podcasts
Or


This episode explores reinforcement learning and its relationship to MDPs. Also mentioned: exploration v. exploitation, multi-arm bandits, model-free learning, q-learning.
Disclosure: This episode was generated using NotebookLM by uploading Professor Chris Callison-Burch's lecture notes and slides.
By Chris Callison-BurchThis episode explores reinforcement learning and its relationship to MDPs. Also mentioned: exploration v. exploitation, multi-arm bandits, model-free learning, q-learning.
Disclosure: This episode was generated using NotebookLM by uploading Professor Chris Callison-Burch's lecture notes and slides.