AI Deep Dive

146: AI Hacks Code But Fails Pie Charts


Listen Later

Dive into the latest seismic shifts in the AI landscape. We unpack the leak and subsequent reveal of Anthropic's Claude Mythos Preview—a model so powerful it’s restricted to the Project Glasswing defensive cybersecurity coalition rather than being released to the public. Meanwhile, the open-source community strikes back with Z AI's GLM-5.1, an agentic model built for marathon 8-hour autonomous coding sessions that just dethroned the top proprietary models on SWE-Bench Pro. We'll also cover the staggering $30 billion run-rate of Anthropic, their massive 3.5GW compute expansion with Google and Broadcom, the growing consumer demand for "anti-AI" marketing transparency, and the alarming reality that researchers are rapidly running out of benchmarks capable of accurately measuring these frontier systems.

...more
View all episodesView all episodes
Download on the App Store

AI Deep DiveBy Pete Larkin