
Sign up to save your podcasts
Or


Original post:
https://www.interconnects.ai/p/openais-reinforcement-finetuning
Chapters
00:00 Introduction
04:19 The impact of reinforcement finetuning’s existence
07:29 Hypotheses on reinforcement finetuning’s implementation
Figures
Fig. 1, Yann’s Cake
Fig. 2, Grader config
Fig. 3, RLVR learning curves
By Nathan Lambert4.1
99 ratings
Original post:
https://www.interconnects.ai/p/openais-reinforcement-finetuning
Chapters
00:00 Introduction
04:19 The impact of reinforcement finetuning’s existence
07:29 Hypotheses on reinforcement finetuning’s implementation
Figures
Fig. 1, Yann’s Cake
Fig. 2, Grader config
Fig. 3, RLVR learning curves

537 Listeners

1,084 Listeners

289 Listeners

210 Listeners

200 Listeners

305 Listeners

95 Listeners

502 Listeners

133 Listeners

93 Listeners

225 Listeners

152 Listeners

467 Listeners

35 Listeners

39 Listeners