Don't Worry About the Vase Podcast

Claude Opus 4.5: Model Card, Alignment and Safety


Listen Later

Podcast episode for Claude Opus 4.5: Model Card, Alignment and Safety.

* 00:00:00 - Introduction

* 00:01:50 - Claude Opus 4.5 Basic Facts

* 00:03:26 - Claude Opus 4.5 Is The Best Model For Many But Not All Use Cases

* 00:06:02 - Misaligned?

* 00:09:39 - Section 3: Safeguards and Harmlessness

* 00:11:46 - Section 4: Honesty

* 00:13:27 - 5: Agentic Safety

* 00:21:01 - Section 6: Alignment Overview

* 00:29:55 - Alignment Investigations

* 00:30:35 - Sycophancy Course Correction Is Lacking

* 00:31:52 - Deception

* 00:34:29 - Ruling Out Encoded Content In Chain Of Thought

* 00:37:19 - Sandbagging

* 00:38:10 - Evaluation Awareness

* 00:42:18 - Reward Hacking

* 00:43:59 - Subversion Strategy

* 00:45:30 - 6.13: UK AISI External Testing

* 00:45:39 - 6.14: Model Welfare

* 00:46:33 - 7: RSP Evaluations

* 00:48:12 - CBRN

* 00:56:36 - Autonomy

* 01:04:27 - Cyber

* 01:10:32 - The Whisperers Love The Vibes

The Don’t Worry About the Vase Podcast is a listener-supported podcast. To receive new posts and support the cost of creation, consider becoming a free or paid subscriber.

https://open.substack.com/pub/thezvi/p/claude-opus-45-model-card-alignment?r=67y1h&utm_campaign=post&utm_medium=web&showWelcomeOnShare=false



Get full access to DWAtV Podcast at dwatvpodcast.substack.com/subscribe
...more
View all episodesView all episodes
Download on the App Store

Don't Worry About the Vase PodcastBy Podcast for Zvi's blog, Don't Worry About the Vase Podcast

  • 4.5
  • 4.5
  • 4.5
  • 4.5
  • 4.5

4.5

6 ratings


More shows like Don't Worry About the Vase Podcast

View all
Odd Lots by Bloomberg

Odd Lots

1,960 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,461 Listeners

Decoder with Nilay Patel by The Verge

Decoder with Nilay Patel

3,138 Listeners

ChinaTalk by Jordan Schneider

ChinaTalk

289 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

97 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

528 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

505 Listeners

Hard Fork by The New York Times

Hard Fork

5,529 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

142 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

629 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

151 Listeners

Prof G Markets by Vox Media Podcast Network

Prof G Markets

1,425 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

134 Listeners

The Marginal Revolution Podcast by Mercatus Center at George Mason University

The Marginal Revolution Podcast

93 Listeners

OpenAI Podcast by OpenAI

OpenAI Podcast

51 Listeners