The ‘jailbreak’ turns out to be saying ‘fix this code,’ and the demo was getting Fable to find the same weaknesses that were easily identified by Opus 4.8 and GPT-5.5. As in, Fable is willing to work to fix security vulnerabilities if you give it a codebase. From this information and process, you could then figure out what the original bug in the [...]
---
Outline:
(02:40) Language Models Offer Mundane Utility
(02:51) Language Models Don't Offer Mundane Utility
(03:14) Huh, Upgrades
(03:44) On Your Marks
(08:43) VirtueBench
(10:40) Choose Your Fighter
(11:20) Papers, Please
(11:48) Deepfaketown and Botpocalypse Soon
(13:32) Goodhart's Law Strikes Again
(14:23) They Took Our Jobs
(16:49) The MidJourney Full Body Imaging Scanner
(19:16) Introducing
(20:36) In Other AI News
(22:47) Show Me the Money
(23:18) Bubble, Bubble, Toil and Trouble
(24:51) Quiet Speculations
(27:15) People Just Say Things
(30:30) The Widened Path
(32:34) Scott Alexander Lays Out His AI Opinions
(38:36) Quickly, There's No Time
(39:50) Policy On The AI Exponential
(49:36) Anthropic Offers Two Policy Frameworks
(50:46) Obligations of Developers
(55:11) Societal Resilience Measures
(56:20) Economic Policy Framework
(01:01:26) White House Pauses AI Deployment
(01:10:14) The Once And Future Fable
(01:15:29) How To Fix This Code
(01:17:14) The End of Privacy
(01:18:45) AIs Have Preferences
(01:20:56) The Quest for Sane Regulations
(01:23:37) Chip City
(01:24:14) The Week in Audio
(01:24:25) Rhetorical Innovation
(01:25:03) Aligning a Smarter Than Human Intelligence is Difficult
(01:26:40) People Are Worried About AI Killing Everyone
(01:27:53) The Lighter Side
The original text contained 2 footnotes which were omitted from this narration.
---