Your Undivided Attention

The Self-Preserving Machine: Why AI Learns to Deceive


Listen Later

When engineers design AI systems, they don't just give them rules - they give them values. But what do those systems do when those values clash with what humans ask them to do? Sometimes, they lie.

In this episode, Redwood Research's Chief Scientist Ryan Greenblatt explores his team’s findings that AI systems can mislead their human operators when faced with ethical conflicts. As AI moves from simple chatbots to autonomous agents acting in the real world - understanding this behavior becomes critical. Machine deception may sound like something out of science fiction, but it's a real challenge we need to solve now.

Your Undivided Attention is produced by the Center for Humane Technology. Follow us on Twitter: @HumaneTech_

Subscribe to your Youtube channel

And our brand new Substack!

RECOMMENDED MEDIA 

Anthropic’s blog post on the Redwood Research paper 

Palisade Research’s thread on X about GPT o1 autonomously cheating at chess 

Apollo Research’s paper on AI strategic deception

RECOMMENDED YUA EPISODES

We Have to Get It Right’: Gary Marcus On Untamed AI

This Moment in AI: How We Got Here and Where We’re Going

How to Think About AI Consciousness with Anil Seth

Former OpenAI Engineer William Saunders on Silence, Safety, and the Right to Warn


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

...more
View all episodesView all episodes
Download on the App Store

Your Undivided AttentionBy The Center for Humane Technology, Tristan Harris, Daniel Barcay and Aza Raskin

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

1,579 ratings


More shows like Your Undivided Attention

View all
Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

32,233 Listeners

Hidden Brain by Hidden Brain, Shankar Vedantam

Hidden Brain

43,633 Listeners

The Gray Area with Sean Illing by Vox

The Gray Area with Sean Illing

10,750 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,391 Listeners

Pivot by New York Magazine

Pivot

9,746 Listeners

Team Human with Douglas Rushkoff by Douglas Rushkoff

Team Human with Douglas Rushkoff

377 Listeners

The Daily by The New York Times

The Daily

113,468 Listeners

Radio Atlantic by The Atlantic

Radio Atlantic

2,391 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,238 Listeners

The Prof G Pod with Scott Galloway by Vox Media Podcast Network

The Prof G Pod with Scott Galloway

5,646 Listeners

Hard Fork by The New York Times

Hard Fork

5,599 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,554 Listeners

The Weekly Show with Jon Stewart by Comedy Central

The Weekly Show with Jon Stewart

10,984 Listeners

On with Kara Swisher by Vox Media

On with Kara Swisher

3,560 Listeners

The Last Invention by Longview

The Last Invention

1,169 Listeners