
Sign up to save your podcasts
Or


This is a brief research note describing the results of running @Jozdien's research code for the paper "Reasoning Models Sometimes Output Illegible Chains of Thought" using the Novita provider on OpenRouter.
tl;dr:
background
In this comment, I wrote (emphasis added):
I'm somewhat skeptical of that paper's interpretation of the observations it reports, at least for R1 and R1-Zero.
I have used [...]
---
Outline:
(01:21) background
(04:09) the setup
(04:13) review of the original codebase
(05:21) what i did
(06:17) results
(09:25) breaking down correctness
The original text contained 4 footnotes which were omitted from this narration.
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
By LessWrongThis is a brief research note describing the results of running @Jozdien's research code for the paper "Reasoning Models Sometimes Output Illegible Chains of Thought" using the Novita provider on OpenRouter.
tl;dr:
background
In this comment, I wrote (emphasis added):
I'm somewhat skeptical of that paper's interpretation of the observations it reports, at least for R1 and R1-Zero.
I have used [...]
---
Outline:
(01:21) background
(04:09) the setup
(04:13) review of the original codebase
(05:21) what i did
(06:17) results
(09:25) breaking down correctness
The original text contained 4 footnotes which were omitted from this narration.
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

112,347 Listeners

130 Listeners

7,244 Listeners

560 Listeners

16,327 Listeners

4 Listeners

14 Listeners

2 Listeners