
Sign up to save your podcasts
Or


Our guest this week is David Dalrymple, who goes by Davidad. Davidad is one of the world's foremost and early researchers of AI “alignment:" how we get AI systems to act the way we want them to.
In order to do that, Davidad has taken on the strange role of being like a therapist to AI systems. He interrogates why they say and do the things that they do, probing them, asking them questions, analyzing their answers. And what he’s come to realize is that AI models have really different ways of seeing the world than people do. They have these quirky, confusing, and sometimes concerning behaviors, especially when you ask things like: what does an AI model understand about itself?
In this episode, we’re going to hear from Davidad about his research, how it’s changed the way he thinks about AI, and what his findings mean for how we build, deploy, and use AI products. His conclusions are unconventional, controversial — and worth grappling with as AI reshapes our world.
RECOMMENDED MEDIA
Anthropic’s new constitution for Claude
“What Is It Like to Be a Bat?” by Thomas Nagel
More information on the Bodisattva
RECOMMENDED YUA EPISODES
The Self-Preserving Machine: Why AI Learns to Deceive
How to Think About AI Consciousness with Anil Seth
Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.
By The Center for Humane Technology, Tristan Harris, Aza Raskin4.8
15791,579 ratings
Our guest this week is David Dalrymple, who goes by Davidad. Davidad is one of the world's foremost and early researchers of AI “alignment:" how we get AI systems to act the way we want them to.
In order to do that, Davidad has taken on the strange role of being like a therapist to AI systems. He interrogates why they say and do the things that they do, probing them, asking them questions, analyzing their answers. And what he’s come to realize is that AI models have really different ways of seeing the world than people do. They have these quirky, confusing, and sometimes concerning behaviors, especially when you ask things like: what does an AI model understand about itself?
In this episode, we’re going to hear from Davidad about his research, how it’s changed the way he thinks about AI, and what his findings mean for how we build, deploy, and use AI products. His conclusions are unconventional, controversial — and worth grappling with as AI reshapes our world.
RECOMMENDED MEDIA
Anthropic’s new constitution for Claude
“What Is It Like to Be a Bat?” by Thomas Nagel
More information on the Bodisattva
RECOMMENDED YUA EPISODES
The Self-Preserving Machine: Why AI Learns to Deceive
How to Think About AI Consciousness with Anil Seth
Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

32,246 Listeners

43,687 Listeners

10,747 Listeners

26,380 Listeners

9,724 Listeners

380 Listeners

113,121 Listeners

2,380 Listeners

7,244 Listeners

5,610 Listeners

5,576 Listeners

16,525 Listeners

11,013 Listeners

3,538 Listeners

1,149 Listeners