LessWrong posts by zvi

“Claude Mythos #3: Capabilities and Additions” by Zvi


Listen Later

To round out coverage of Mythos, today covers capabilities other than cyber, and anything else additional not covered by the first two posts, including new reactions and details.

Post one covered the model card, post two covered cybersecurity.

There really is a lot to get through.

Understanding AI had an additional writeup of Project Glasswing I missed last time. I liked the metaphor of Opus as a butter knife and Mythos as a steak knife. Yes, technically you can do it all with the butter knife, but you won’t.

As Dan Schwarz reminds us, not only does AI 2027 roughly have the timeline right and a bunch of the numbers lining up, the details so far are remarkably close.

JPM's Michael Cembalest was not based on JPMorgan's participation, only on public information.

The White House is racing to deal with the situation, head off potential threats and pretend it has everything under control. They were warned, but refused to believe. The good news is that key people believe it now, and it seems all the major players are cooperating on this.

My overall take is that Mythos is not a trend break [...]

---

Outline:

(01:52) Epoch Capabilities Index (ECI) (Model Card 2.3.6)

(04:29) What Do You Mean Verbalized Evaluation Awareness Is Going Down

(05:19) Capabilities (Model Card Section 6)

(07:33) Agentic Safety Benchmarks (8.3)

(09:00) Is Mythos AGI?

(10:09) Are AI Companies Using Warnings As Hype?

(11:04) Impressions (Model Card Section 7)

(14:11) Blatant Denials Are The Best Kind

(15:12) Prompt Injection Robustness

(16:07) Does Mythos Cross The New Knowledge Threshold?

(17:01) Is Mythos Surprising or Discontinuous?

(20:57) UK AISI Tests Claude Mythos On Cybersecurity

(22:08) Everything Reinforces My Existing Predictions And Policy Preferences

(27:24) Solve For The Equilibrium

(28:46) Does Not Compute

(29:47) Conclusion: How To Think About Mythos

---

First published:

April 14th, 2026

Source:

https://www.lesswrong.com/posts/2ziYGFK7QmbbLgBoP/claude-mythos-3-capabilities-and-additions

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

...more
View all episodesView all episodes
Download on the App Store

LessWrong posts by zviBy zvi

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like LessWrong posts by zvi

View all
Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,380 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,461 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,105 Listeners

Future of Life Institute Podcast by Future of Life Institute

Future of Life Institute Podcast

109 Listeners

ChinaTalk by Jordan Schneider

ChinaTalk

291 Listeners

Politix by Politix

Politix

90 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

551 Listeners

Hard Fork by The New York Times

Hard Fork

5,576 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

137 Listeners

LessWrong (Curated & Popular) by LessWrong

LessWrong (Curated & Popular)

13 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

150 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

147 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

475 Listeners

LessWrong (30+ Karma) by LessWrong

LessWrong (30+ Karma)

0 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

143 Listeners