December 23, 2024

Reward engineering for better content recommendation [Netflix]

11 minutes

In this episode, we will explore Netflix’s approach to content recommendation using contextual bandits and reward engineering. We will also discuss the important role of proxy reward functions and how Netflix leverages offline machine learning models to predict delayed customer feedback, enabling them to continuously improve their recommendation engine and deliver a more personalized viewing experience.

For more details, you can refer to their published tech blog, linked here for your reference: https://netflixtechblog.com/recommending-for-long-term-member-satisfaction-at-netflix-ac15cada49ef

...more

View all episodes

By Pan Wu

99 ratings

December 23, 2024

Reward engineering for better content recommendation [Netflix]

11 minutes

For more details, you can refer to their published tech blog, linked here for your reference: https://netflixtechblog.com/recommending-for-long-term-member-satisfaction-at-netflix-ac15cada49ef

...more