80k After Hours

Highlights: #214 – Buck Shlegeris on controlling AI that wants to take over – so we can use it anyway


Listen Later

Most AI safety conversations centre on alignment: ensuring AI systems share our values and goals. But despite progress, we’re unlikely to know we’ve solved the problem before the arrival of human-level and superhuman systems in as little as three years.

So some — including Buck Shlegeris, CEO of Redwood Research — are developing a backup plan to safely deploy models we fear are actively scheming to harm us: so-called “AI control.” While this may sound mad, given the reluctance of AI companies to delay deploying anything they train, not developing such techniques is probably even crazier.

These highlights are from episode #214 of The 80,000 Hours Podcast: Buck Shlegeris on controlling AI that wants to take over – so we can use it anyway, and include:

  • What is AI control? (00:00:15)
  • One way to catch AIs that are up to no good (00:07:00)
  • What do we do once we catch a model trying to escape? (00:13:39)
  • Team Human vs Team AI (00:18:24)
  • If an AI escapes, is it likely to be able to beat humanity from there? (00:24:59)
  • Is alignment still useful? (00:32:10)
  • Could 10 safety-focused people in an AGI company do anything useful? (00:35:34)

These aren't necessarily the most important or even most entertaining parts of the interview — so if you enjoy this, we strongly recommend checking out the full episode!

And if you're finding these highlights episodes valuable, please let us know by emailing [email protected].

Highlights put together by Ben Cordell, Milo McGuire, and Dominic Armstrong

...more
View all episodesView all episodes
Download on the App Store

80k After HoursBy The 80000 Hours team

  • 5
  • 5
  • 5
  • 5
  • 5

5

14 ratings


More shows like 80k After Hours

View all
EconTalk by Russ Roberts

EconTalk

4,208 Listeners

Very Bad Wizards by Tamler Sommers & David Pizarro

Very Bad Wizards

2,638 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,385 Listeners

The Good Fight by Yascha Mounk

The Good Fight

892 Listeners

The Joe Walker Podcast by Joe Walker

The Joe Walker Podcast

120 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll | Wondery

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,098 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

332 Listeners

Decoding the Gurus by Christopher Kavanagh and Matthew Browne

Decoding the Gurus

924 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

136 Listeners

The Rest Is History by Goalhanger

The Rest Is History

12,116 Listeners

Empire by Goalhanger

Empire

2,016 Listeners

Past Present Future by David Runciman

Past Present Future

282 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

139 Listeners

The Studies Show by Tom Chivers and Stuart Ritchie

The Studies Show

61 Listeners

Lives Well Lived by Peter Singer & Kasia de Lazari Radek

Lives Well Lived

36 Listeners