
Sign up to save your podcasts
Or


Only six weeks after Opus 4.7, we have Opus 4.8.
For everyone, that means another incremental upgrade to Claude. It is once again smarter, and can do tasks for longer, and comes with a number of hot new features.
For me, that also means reading another 244 page system card.
It was only April 20 when I did a full review of the Opus 4.7 system card, plus an additional post focusing on related issues of model welfare.
These updates are incremental and coming more rapidly, and this still is below the capability level of Claude Mythos, so the focus will be on the delta. What is different about Opus 4.8 versus what we already know about Opus 4.7 and Mythos?
It turns out there's still a lot to talk about.
Table of Contents
---
Outline:
(01:16) Here We Go Again: Executive Summary
(02:33) Introduction (1)
(02:42) RSP Evaluations (2)
(03:47) Move That Goalpost
(05:41) The Failures Are News
(07:33) Alignment Risk Slowly Rises
(09:00) New Risk Pathways Just Dropped
(11:26) Cyber (3)
(12:22) Harmful Requests (4.1)
(14:23) We Need To Talk (4.2 and 4.3)
(17:36) Overcoming Bias (4.4)
(19:33) Agentic Safety (5)
(21:40) Prompt Injection (5.2)
(25:18) Alignment (6)
(26:33) Looking For Problems
(27:55) Who Watches The Training (6.2.2)
(32:07) Automated Behavioral Audit
(32:47) The Model Is Smarter Than The Eval (6.2.3.2)
(34:39) You Should See The Other Guy
(36:30) UK AISI Testing (6.2.4)
(36:50) In Vendbench (6.2.5)
(39:27) Honesty (6.3.3 to 6.3.6)
(41:35) Chain of Thought (CoT) Monitorability (6.5)
(44:09) What's In The Box? (6.6)
(45:57) That's All For Now
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
By zvi5
22 ratings
Only six weeks after Opus 4.7, we have Opus 4.8.
For everyone, that means another incremental upgrade to Claude. It is once again smarter, and can do tasks for longer, and comes with a number of hot new features.
For me, that also means reading another 244 page system card.
It was only April 20 when I did a full review of the Opus 4.7 system card, plus an additional post focusing on related issues of model welfare.
These updates are incremental and coming more rapidly, and this still is below the capability level of Claude Mythos, so the focus will be on the delta. What is different about Opus 4.8 versus what we already know about Opus 4.7 and Mythos?
It turns out there's still a lot to talk about.
Table of Contents
---
Outline:
(01:16) Here We Go Again: Executive Summary
(02:33) Introduction (1)
(02:42) RSP Evaluations (2)
(03:47) Move That Goalpost
(05:41) The Failures Are News
(07:33) Alignment Risk Slowly Rises
(09:00) New Risk Pathways Just Dropped
(11:26) Cyber (3)
(12:22) Harmful Requests (4.1)
(14:23) We Need To Talk (4.2 and 4.3)
(17:36) Overcoming Bias (4.4)
(19:33) Agentic Safety (5)
(21:40) Prompt Injection (5.2)
(25:18) Alignment (6)
(26:33) Looking For Problems
(27:55) Who Watches The Training (6.2.2)
(32:07) Automated Behavioral Audit
(32:47) The Model Is Smarter Than The Eval (6.2.3.2)
(34:39) You Should See The Other Guy
(36:30) UK AISI Testing (6.2.4)
(36:50) In Vendbench (6.2.5)
(39:27) Honesty (6.3.3 to 6.3.6)
(41:35) Chain of Thought (CoT) Monitorability (6.5)
(44:09) What's In The Box? (6.6)
(45:57) That's All For Now
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

26,278 Listeners

2,448 Listeners

1,107 Listeners

108 Listeners

288 Listeners

89 Listeners

564 Listeners

5,554 Listeners

138 Listeners

12 Listeners

146 Listeners

149 Listeners

460 Listeners

0 Listeners

141 Listeners