
Sign up to save your podcasts
Or


In this episode, we will explore Netflix’s approach to content recommendation using contextual bandits and reward engineering. We will also discuss the important role of proxy reward functions and how Netflix leverages offline machine learning models to predict delayed customer feedback, enabling them to continuously improve their recommendation engine and deliver a more personalized viewing experience.
For more details, you can refer to their published tech blog, linked here for your reference: https://netflixtechblog.com/recommending-for-long-term-member-satisfaction-at-netflix-ac15cada49ef
By Pan Wu5
99 ratings
In this episode, we will explore Netflix’s approach to content recommendation using contextual bandits and reward engineering. We will also discuss the important role of proxy reward functions and how Netflix leverages offline machine learning models to predict delayed customer feedback, enabling them to continuously improve their recommendation engine and deliver a more personalized viewing experience.
For more details, you can refer to their published tech blog, linked here for your reference: https://netflixtechblog.com/recommending-for-long-term-member-satisfaction-at-netflix-ac15cada49ef

537 Listeners

4,631 Listeners

4,349 Listeners

112,408 Listeners

800 Listeners

9,927 Listeners