Joe Carlsmith Audio

Takes on "Alignment Faking in Large Language Models"


Listen Later

What can we learn from recent empirical demonstrations of scheming in frontier models? Text version here: https://joecarlsmith.com/2024/12/18/takes-on-alignment-faking-in-large-language-models/

...more
View all episodesView all episodes
Download on the App Store

Joe Carlsmith AudioBy Joe Carlsmith

  • 5
  • 5
  • 5
  • 5
  • 5

5

15 ratings


More shows like Joe Carlsmith Audio

View all
The Joe Rogan Experience by Joe Rogan

The Joe Rogan Experience

229,020 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,461 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

531 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

142 Listeners