
Sign up to save your podcasts
Or


List of Excerpts from Mythos model card. Tried to include interesting things, but also included some boring to be expected things. I omitted some things that were too long.
Also wanna note,
Capability Stuff
Anthropic Staff Opinion About whether Mythos is a drop-in replacement for entry Research Eng/Scientist
We did an n=18 survey on Claude Mythos Preview's strengths and limitations. 1/18 participants thought we already had a drop-in replacement for an entry-level Research Scientist or Engineer, and 4 thought Claude Mythos Preview had a 50% chance of qualifying as such with 3 months of scaffolding iteration. We suspect those numbers would go down with a clarifying dialogue, as they did in the last model release, but we didn’t engage in such a dialogue this time.
Model hallucinates much less and also gets dramatically better [...]
---
Outline:
(00:43) Capability Stuff
(03:52) Cyber Capabilities
(05:08) Alignment
(21:20) White Box Evaluation of Model Internals
(23:52) Model Welfare
(24:36) Alignment Risk Update companion report
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
By LessWrong
List of Excerpts from Mythos model card. Tried to include interesting things, but also included some boring to be expected things. I omitted some things that were too long.
Also wanna note,
Capability Stuff
Anthropic Staff Opinion About whether Mythos is a drop-in replacement for entry Research Eng/Scientist
We did an n=18 survey on Claude Mythos Preview's strengths and limitations. 1/18 participants thought we already had a drop-in replacement for an entry-level Research Scientist or Engineer, and 4 thought Claude Mythos Preview had a 50% chance of qualifying as such with 3 months of scaffolding iteration. We suspect those numbers would go down with a clarifying dialogue, as they did in the last model release, but we didn’t engage in such a dialogue this time.
Model hallucinates much less and also gets dramatically better [...]
---
Outline:
(00:43) Capability Stuff
(03:52) Cyber Capabilities
(05:08) Alignment
(21:20) White Box Evaluation of Model Internals
(23:52) Model Welfare
(24:36) Alignment Risk Update companion report
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

113,121 Listeners

131 Listeners

7,244 Listeners

551 Listeners

16,525 Listeners

4 Listeners

14 Listeners

2 Listeners