
Sign up to save your podcasts
Or


To round out coverage of Mythos, today covers capabilities other than cyber, and anything else additional not covered by the first two posts, including new reactions and details.
Post one covered the model card, post two covered cybersecurity.
There really is a lot to get through.
Understanding AI had an additional writeup of Project Glasswing I missed last time. I liked the metaphor of Opus as a butter knife and Mythos as a steak knife. Yes, technically you can do it all with the butter knife, but you won’t.
As Dan Schwarz reminds us, not only does AI 2027 roughly have the timeline right and a bunch of the numbers lining up, the details so far are remarkably close.
JPM's Michael Cembalest was not based on JPMorgan's participation, only on public information.
The White House is racing to deal with the situation, head off potential threats and pretend it has everything under control. They were warned, but refused to believe. The good news is that key people believe it now, and it seems all the major players are cooperating on this.
My overall take is that Mythos is not a trend break [...]
---
Outline:
(01:52) Epoch Capabilities Index (ECI) (Model Card 2.3.6)
(04:29) What Do You Mean Verbalized Evaluation Awareness Is Going Down
(05:19) Capabilities (Model Card Section 6)
(07:33) Agentic Safety Benchmarks (8.3)
(09:00) Is Mythos AGI?
(10:09) Are AI Companies Using Warnings As Hype?
(11:04) Impressions (Model Card Section 7)
(14:11) Blatant Denials Are The Best Kind
(15:12) Prompt Injection Robustness
(16:07) Does Mythos Cross The New Knowledge Threshold?
(17:01) Is Mythos Surprising or Discontinuous?
(20:57) UK AISI Tests Claude Mythos On Cybersecurity
(22:08) Everything Reinforces My Existing Predictions And Policy Preferences
(27:24) Solve For The Equilibrium
(28:46) Does Not Compute
(29:47) Conclusion: How To Think About Mythos
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
By zvi5
22 ratings
To round out coverage of Mythos, today covers capabilities other than cyber, and anything else additional not covered by the first two posts, including new reactions and details.
Post one covered the model card, post two covered cybersecurity.
There really is a lot to get through.
Understanding AI had an additional writeup of Project Glasswing I missed last time. I liked the metaphor of Opus as a butter knife and Mythos as a steak knife. Yes, technically you can do it all with the butter knife, but you won’t.
As Dan Schwarz reminds us, not only does AI 2027 roughly have the timeline right and a bunch of the numbers lining up, the details so far are remarkably close.
JPM's Michael Cembalest was not based on JPMorgan's participation, only on public information.
The White House is racing to deal with the situation, head off potential threats and pretend it has everything under control. They were warned, but refused to believe. The good news is that key people believe it now, and it seems all the major players are cooperating on this.
My overall take is that Mythos is not a trend break [...]
---
Outline:
(01:52) Epoch Capabilities Index (ECI) (Model Card 2.3.6)
(04:29) What Do You Mean Verbalized Evaluation Awareness Is Going Down
(05:19) Capabilities (Model Card Section 6)
(07:33) Agentic Safety Benchmarks (8.3)
(09:00) Is Mythos AGI?
(10:09) Are AI Companies Using Warnings As Hype?
(11:04) Impressions (Model Card Section 7)
(14:11) Blatant Denials Are The Best Kind
(15:12) Prompt Injection Robustness
(16:07) Does Mythos Cross The New Knowledge Threshold?
(17:01) Is Mythos Surprising or Discontinuous?
(20:57) UK AISI Tests Claude Mythos On Cybersecurity
(22:08) Everything Reinforces My Existing Predictions And Policy Preferences
(27:24) Solve For The Equilibrium
(28:46) Does Not Compute
(29:47) Conclusion: How To Think About Mythos
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

26,380 Listeners

2,461 Listeners

1,105 Listeners

109 Listeners

291 Listeners

90 Listeners

551 Listeners

5,576 Listeners

137 Listeners

13 Listeners

150 Listeners

147 Listeners

475 Listeners

0 Listeners

143 Listeners