Clearer Thinking with Spencer Greenberg

How can AIs know what we want if *we* don't even know? (with Geoffrey Irving)


Listen Later

Read the full transcript here.

What does it really mean to align an AI system with human values? What would a powerful AI need to do in order to do "what we want"? How does being an assistant differ from being an agent? Could inter-AI debate work as an alignment strategy, or would it just result in arguments designed to manipulate humans via their cognitive and emotional biases? How can we make sure that all human values are learned by AIs, not just the values of humans in WEIRD societies? Are our current state-of-the-art LLMs politically left-leaning? How can alignment strategies take into account the fact that our individual and collective values occasionally change over time?

Geoffrey Irving is an AI safety researcher at DeepMind. Before that, he led the Reflection Team at OpenAI, was involved in neural network theorem proving at Google Brain, cofounded Eddy Systems to autocorrect code as you type, and worked on computational physics and geometry at Otherlab, D. E. Shaw Research, Pixar, and Weta Digital. He has screen credits on Ratatouille, WALL•E, Up, and Tintin. Learn more about him at his website, naml.us.

Further reading

  • Gandalf: An Educational Game Demonstrating Security Vulnerabilities in Large Language Models
  • "AI safety via debate"
  • "Claude's Constitution"

Staff

  • Spencer Greenberg — Host / Director
  • Josh Castle — Producer
  • Ryan Kessler — Audio Engineer
  • Uri Bram — Factotum
  • WeAmplify — Transcriptionists

Music

  • Broke for Free
  • Josh Woodward
  • Lee Rosevere
  • Quiet Music for Tiny Robots
  • wowamusic
  • zapsplat.com

Affiliates

  • Clearer Thinking
  • GuidedTrack
  • Mind Ease
  • Positly
  • UpLift
[Read more]
...more
View all episodesView all episodes
Download on the App Store

Clearer Thinking with Spencer GreenbergBy Spencer Greenberg

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

126 ratings


More shows like Clearer Thinking with Spencer Greenberg

View all
EconTalk by Russ Roberts

EconTalk

4,224 Listeners

You Are Not So Smart by You Are Not So Smart

You Are Not So Smart

1,716 Listeners

Very Bad Wizards by Tamler Sommers & David Pizarro

Very Bad Wizards

2,651 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,446 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,389 Listeners

The Gray Area with Sean Illing by Vox

The Gray Area with Sean Illing

10,687 Listeners

The Good Fight by Yascha Mounk

The Good Fight

893 Listeners

The Joe Walker Podcast by Joe Walker

The Joe Walker Podcast

120 Listeners

ManifoldOne by Steve Hsu

ManifoldOne

87 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

389 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

15,220 Listeners

"Upstream" with Erik Torenberg by Erik Torenberg

"Upstream" with Erik Torenberg

60 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

145 Listeners

Lives Well Lived by Peter Singer & Kasia de Lazari Radek

Lives Well Lived

44 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

123 Listeners