April 08, 2026

“Excerpts and Notes on Mythos Model Card” by williawa

28 minutes

List of Excerpts from Mythos model card. Tried to include interesting things, but also included some boring to be expected things. I omitted some things that were too long.

Also wanna note,

that this list of excerpts highlights "concerning" things above the rate at which they occur in the document.
I frequently say "Anthropic seems to think ..." or "their theory appears to be that ...", and this doesn't mean I think the opinion is unsubstantiated or that they are wrong, its just a natural way to phrase things for me.

Capability Stuff

Anthropic Staff Opinion About whether Mythos is a drop-in replacement for entry Research Eng/Scientist

We did an n=18 survey on Claude Mythos Preview's strengths and limitations. 1/18 participants thought we already had a drop-in replacement for an entry-level Research Scientist or Engineer, and 4 thought Claude Mythos Preview had a 50% chance of qualifying as such with 3 months of scaffolding iteration. We suspect those numbers would go down with a clarifying dialogue, as they did in the last model release, but we didn’t engage in such a dialogue this time.

Model hallucinates much less and also gets dramatically better [...]

---

Outline:

(00:43) Capability Stuff

(03:52) Cyber Capabilities

(05:08) Alignment

(21:20) White Box Evaluation of Model Internals

(23:52) Model Welfare

(24:36) Alignment Risk Update companion report

---

First published:

April 8th, 2026

Source:

https://www.lesswrong.com/posts/ZfbChZBXgje8T6Geu/excerpts-and-notes-on-mythos-model-card

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

...more

View all episodes

By LessWrong

April 08, 2026

“Excerpts and Notes on Mythos Model Card” by williawa

28 minutes

List of Excerpts from Mythos model card. Tried to include interesting things, but also included some boring to be expected things. I omitted some things that were too long.

Also wanna note,

that this list of excerpts highlights "concerning" things above the rate at which they occur in the document.
I frequently say "Anthropic seems to think ..." or "their theory appears to be that ...", and this doesn't mean I think the opinion is unsubstantiated or that they are wrong, its just a natural way to phrase things for me.

Capability Stuff

Anthropic Staff Opinion About whether Mythos is a drop-in replacement for entry Research Eng/Scientist

Model hallucinates much less and also gets dramatically better [...]

---

Outline:

(00:43) Capability Stuff

(03:52) Cyber Capabilities

(05:08) Alignment

(21:20) White Box Evaluation of Model Internals

(23:52) Model Welfare

(24:36) Alignment Risk Update companion report

---

First published:

April 8th, 2026

Source:

https://www.lesswrong.com/posts/ZfbChZBXgje8T6Geu/excerpts-and-notes-on-mythos-model-card

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

...more

More shows like LessWrong (30+ Karma)

View all

The Daily

112,347 Listeners

Astral Codex Ten Podcast

130 Listeners

Interesting Times with Ross Douthat

7,244 Listeners

Dwarkesh Podcast

560 Listeners

The Ezra Klein Show

16,327 Listeners

AI Article Readings

4 Listeners

Doom Debates!

14 Listeners

LessWrong posts by zvi

2 Listeners

Share “Excerpts and Notes on Mythos Model Card” by williawa

Sign up to save your podcasts

“Excerpts and Notes on Mythos Model Card” by williawa

“Excerpts and Notes on Mythos Model Card” by williawa

More shows like LessWrong (30+ Karma)

The Daily

Astral Codex Ten Podcast

Interesting Times with Ross Douthat

Dwarkesh Podcast

The Ezra Klein Show

AI Article Readings

Doom Debates!

LessWrong posts by zvi