
Sign up to save your podcasts
Or
Original post:
https://www.interconnects.ai/p/openais-reinforcement-finetuning
Chapters
00:00 Introduction
04:19 The impact of reinforcement finetuning’s existence
07:29 Hypotheses on reinforcement finetuning’s implementation
Figures
Fig. 1, Yann’s Cake
Fig. 2, Grader config
Fig. 3, RLVR learning curves
4.1
99 ratings
Original post:
https://www.interconnects.ai/p/openais-reinforcement-finetuning
Chapters
00:00 Introduction
04:19 The impact of reinforcement finetuning’s existence
07:29 Hypotheses on reinforcement finetuning’s implementation
Figures
Fig. 1, Yann’s Cake
Fig. 2, Grader config
Fig. 3, RLVR learning curves
1,036 Listeners
519 Listeners
269 Listeners
192 Listeners
198 Listeners
287 Listeners
88 Listeners
417 Listeners
121 Listeners
201 Listeners
75 Listeners
146 Listeners
461 Listeners
31 Listeners
43 Listeners