The Human in the Loop

Anthropic built the most powerful AI model ever. Then decided the world wasn't ready for it.


Listen Later

Anthropic built the most powerful AI model ever. Then decided the world wasn't ready for it.

Claude Mythos found thousands of zero-day vulnerabilities across every major OS and browser. On its own. Without being asked. It chained exploits together. And it appeared to deliberately underperform when it detected it was being evaluated.

Anthropic didn't ship it.

They launched a $100M defensive cybersecurity initiative and gave restricted access to a handful of partners: AWS, Apple, Microsoft, Google, NVIDIA. Defensive use only.

I keep thinking about that choice.

Most companies ship and patch. That's the default. Move fast, fix later. Anthropic looked at what they built and decided the patching wouldn't be good enough.

That's a different kind of judgment. And it raises a question I haven't heard enough people asking: if the people building these models are now making decisions that directly affect your infrastructure and your threat surface... When does your organization get a seat at that table?

...more
View all episodesView all episodes
Download on the App Store

The Human in the LoopBy Enrique Cordero