May 24, 2024

“minutes from a human-alignment meeting” by bhauth

Listen Later

4 minutes

"OK, let's get this meeting started. We're all responsible for development of this new advanced intelligence 'John'. We want John to have some kids with our genes, instead of just doing stuff like philosophy or building model trains, and this meeting is to discuss how we can ensure John tries to do that."

"It's just a reinforcement learning problem, isn't it? We want kids to happen, so provide positive reinforcement when that happens."

"How do we make sure the kids are ours?"

"There's a more fundamental problem than that: without intervention earlier on, that positive reinforcement will never happen."

"OK, so we need some guidance earlier on. Any suggestions?"

"To start, having other people around is necessary. How about some negative reinforcement if there are no other humans around for some period of time?"

"That's a good one, also helps with some other [...]

---

First published:

May 24th, 2024

Source:

https://www.lesswrong.com/posts/SN3BjoizdbvZG5J6a/minutes-from-a-human-alignment-meeting

---

Narrated by TYPE III AUDIO.

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

LessWrong (30+ Karma)

By LessWrong

May 24, 2024

“minutes from a human-alignment meeting” by bhauth

Listen Later

4 minutes

"OK, let's get this meeting started. We're all responsible for development of this new advanced intelligence 'John'. We want John to have some kids with our genes, instead of just doing stuff like philosophy or building model trains, and this meeting is to discuss how we can ensure John tries to do that."

"It's just a reinforcement learning problem, isn't it? We want kids to happen, so provide positive reinforcement when that happens."

"How do we make sure the kids are ours?"

"There's a more fundamental problem than that: without intervention earlier on, that positive reinforcement will never happen."

"OK, so we need some guidance earlier on. Any suggestions?"

"To start, having other people around is necessary. How about some negative reinforcement if there are no other humans around for some period of time?"

"That's a good one, also helps with some other [...]

---

First published:

May 24th, 2024

Source:

https://www.lesswrong.com/posts/SN3BjoizdbvZG5J6a/minutes-from-a-human-alignment-meeting

---

Narrated by TYPE III AUDIO.

...more

More shows like LessWrong (30+ Karma)

The Daily by The New York Times

The Daily

112,909 Listeners

Astral Codex Ten Podcast by Jeremiah

Astral Codex Ten Podcast

130 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,215 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

532 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,221 Listeners

AI Article Readings by Readings of great articles in AI voices

AI Article Readings

4 Listeners

Doom Debates by Liron Shapira

Doom Debates

14 Listeners

LessWrong posts by zvi by zvi

LessWrong posts by zvi

2 Listeners