LessWrong (30+ Karma)

“minutes from a human-alignment meeting” by bhauth


Listen Later

"OK, let's get this meeting started. We're all responsible for development of this new advanced intelligence 'John'. We want John to have some kids with our genes, instead of just doing stuff like philosophy or building model trains, and this meeting is to discuss how we can ensure John tries to do that."

"It's just a reinforcement learning problem, isn't it? We want kids to happen, so provide positive reinforcement when that happens."

"How do we make sure the kids are ours?"

"There's a more fundamental problem than that: without intervention earlier on, that positive reinforcement will never happen."

"OK, so we need some guidance earlier on. Any suggestions?"

"To start, having other people around is necessary. How about some negative reinforcement if there are no other humans around for some period of time?"

"That's a good one, also helps with some other [...]

---

First published:

May 24th, 2024

Source:

https://www.lesswrong.com/posts/SN3BjoizdbvZG5J6a/minutes-from-a-human-alignment-meeting

---

Narrated by TYPE III AUDIO.

...more
View all episodesView all episodes
Download on the App Store

LessWrong (30+ Karma)By LessWrong


More shows like LessWrong (30+ Karma)

View all
The Daily by The New York Times

The Daily

112,909 Listeners

Astral Codex Ten Podcast by Jeremiah

Astral Codex Ten Podcast

130 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,215 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

532 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,221 Listeners

AI Article Readings by Readings of great articles in AI voices

AI Article Readings

4 Listeners

Doom Debates by Liron Shapira

Doom Debates

14 Listeners

LessWrong posts by zvi by zvi

LessWrong posts by zvi

2 Listeners