
Sign up to save your podcasts
Or
Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI.
In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace.
Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models.
The complete show notes for this episode can be found at twimlai.com/go/448.
4.7
415415 ratings
Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI.
In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace.
Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models.
The complete show notes for this episode can be found at twimlai.com/go/448.
162 Listeners
481 Listeners
298 Listeners
323 Listeners
147 Listeners
265 Listeners
189 Listeners
289 Listeners
88 Listeners
122 Listeners
199 Listeners
76 Listeners
441 Listeners
30 Listeners
36 Listeners