February 18, 2021

BI 098 Brian Christian: The Alignment Problem

Listen Later

1 hour 32 minutes

Brian and I discuss a range of topics related to his latest book, The Alignment Problem: Machine Learning and Human Values. The alignment problem asks how we can build AI that does what we want it to do, as opposed to building AI that will compromise our own values by accomplishing tasks that may be harmful or dangerous to us. Using some of the stories Brain relates in the book, we talk about:

The history of machine learning and how we got this point;
Some methods researches are creating to understand what's being represented in neural nets and how they generate their output;
Some modern proposed solutions to the alignment problem, like programming the machines to learn our preferences so they can help achieve those preferences - an idea called inverse reinforcement learning;
The thorny issue of accurately knowing our own values- if we get those wrong, will machines also get it wrong?

Links:

Brian's website.
Twitter: @brianchristian.
The Alignment Problem: Machine Learning and Human Values.
Related papers
- Norbert Wiener from 1960: Some Moral and Technical Consequences of Automation.

Timestamps:

4:22 - Increased work on AI ethics

8:59 - The Alignment Problem overview

12:36 - Stories as important for intelligence

16:50 - What is the alignment problem

17:37 - Who works on the alignment problem?

25:22 - AI ethics degree?

29:03 - Human values

31:33 - AI alignment and evolution

37:10 - Knowing our own values?

46:27 - What have learned about ourselves?

58:51 - Interestingness

1:00:53 - Inverse RL for value alignment

1:04:50 - Current progress

1:10:08 - Developmental psychology

1:17:36 - Models as the danger

1:25:08 - How worried are the experts?

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Brain Inspired

By Paul Middlebrooks

4.8

134134 ratings

February 18, 2021

BI 098 Brian Christian: The Alignment Problem

Listen Later

1 hour 32 minutes

Brian and I discuss a range of topics related to his latest book, The Alignment Problem: Machine Learning and Human Values. The alignment problem asks how we can build AI that does what we want it to do, as opposed to building AI that will compromise our own values by accomplishing tasks that may be harmful or dangerous to us. Using some of the stories Brain relates in the book, we talk about:

The history of machine learning and how we got this point;
Some methods researches are creating to understand what's being represented in neural nets and how they generate their output;
Some modern proposed solutions to the alignment problem, like programming the machines to learn our preferences so they can help achieve those preferences - an idea called inverse reinforcement learning;
The thorny issue of accurately knowing our own values- if we get those wrong, will machines also get it wrong?

Links:

Brian's website.
Twitter: @brianchristian.
The Alignment Problem: Machine Learning and Human Values.
Related papers
- Norbert Wiener from 1960: Some Moral and Technical Consequences of Automation.

Timestamps:

4:22 - Increased work on AI ethics

8:59 - The Alignment Problem overview

12:36 - Stories as important for intelligence

16:50 - What is the alignment problem

17:37 - Who works on the alignment problem?

25:22 - AI ethics degree?

29:03 - Human values

31:33 - AI alignment and evolution

37:10 - Knowing our own values?

46:27 - What have learned about ourselves?

58:51 - Interestingness

1:00:53 - Inverse RL for value alignment

1:04:50 - Current progress

1:10:08 - Developmental psychology

1:17:36 - Models as the danger

1:25:08 - How worried are the experts?

...more

More shows like Brain Inspired

Very Bad Wizards by Tamler Sommers & David Pizarro

Very Bad Wizards

2,680 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,380 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,461 Listeners

The Quanta Podcast by Quanta Magazine

The Quanta Podcast

544 Listeners

Closer To Truth by Closer To Truth

Closer To Truth

246 Listeners

The Michael Shermer Show by Michael Shermer

The Michael Shermer Show

941 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,167 Listeners

The Origins Podcast with Lawrence Krauss by Lawrence M. Krauss

The Origins Podcast with Lawrence Krauss

506 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

203 Listeners

Last Week in AI by Skynet Today

Last Week in AI

313 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

101 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

551 Listeners

Theories of Everything with Curt Jaimungal by Theories of Everything

Theories of Everything with Curt Jaimungal

18 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

137 Listeners

Robinson's Podcast by Robinson Erhardt

Robinson's Podcast

270 Listeners