
Sign up to save your podcasts
Or
In Ep 2 we ask: "Panic or Progress? Reading Between the Lines of AI Safety Tests." We unpack the recent Claude Opus 4 "blackmail" test result, OpenAI's new transparency pledge, and why safety evaluations sometimes sound scarier than they are. Listeners will leave with a clear framework for interpreting headline-grabbing safety reports—and practical advice on when to worry, when to wait, and how to separate red flags from red herrings.
4.8
6060 ratings
In Ep 2 we ask: "Panic or Progress? Reading Between the Lines of AI Safety Tests." We unpack the recent Claude Opus 4 "blackmail" test result, OpenAI's new transparency pledge, and why safety evaluations sometimes sound scarier than they are. Listeners will leave with a clear framework for interpreting headline-grabbing safety reports—and practical advice on when to worry, when to wait, and how to separate red flags from red herrings.
1,044 Listeners
438 Listeners
342 Listeners
189 Listeners
5,448 Listeners
127 Listeners
144 Listeners
66 Listeners
199 Listeners
504 Listeners
249 Listeners
94 Listeners
63 Listeners
32 Listeners
32 Listeners