
Sign up to save your podcasts
Or
In this episode, Tom gives us a lesson on all things feedback, mostly where our scientific framings of it came from.
Together, we link this to RLHF, our previous work in RL, and how we were thinking about agentic ML systems before it was cool.
Join us, on another great blast from the past on The Retort!
We also have brought you video this week!
4.7
99 ratings
In this episode, Tom gives us a lesson on all things feedback, mostly where our scientific framings of it came from.
Together, we link this to RLHF, our previous work in RL, and how we were thinking about agentic ML systems before it was cool.
Join us, on another great blast from the past on The Retort!
We also have brought you video this week!
1,271 Listeners
999 Listeners
513 Listeners
1,786 Listeners
431 Listeners
611 Listeners
340 Listeners
270 Listeners
279 Listeners
8,677 Listeners
348 Listeners
395 Listeners
5,338 Listeners
421 Listeners
441 Listeners