
Sign up to save your podcasts
Or
I'm aware of good arguments that this scenario isn't inevitable, but it still seems frighteningly likely even if we solve technical alignment.
TL;DR:
The first AGIs will probably be aligned to take orders
People in charge of AGI projects like power. And by definition, they like their values somewhat better than the aggregate values of all of humanity. It also seems like there's a pretty strong argument that Instruction-following AGI is easier than value aligned AGI. In the slow-ish takeoff we expect, this alignment target seems to allow for error-correcting alignment [...]
---
Outline:
(00:43) The first AGIs will probably be aligned to take orders
(01:27) The first AGI probably wont perform a pivotal act
(02:30) So RSI-capable AGI may proliferate until a disaster occurs
(03:51) Counterarguments
The original text contained 1 footnote which was omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
I'm aware of good arguments that this scenario isn't inevitable, but it still seems frighteningly likely even if we solve technical alignment.
TL;DR:
The first AGIs will probably be aligned to take orders
People in charge of AGI projects like power. And by definition, they like their values somewhat better than the aggregate values of all of humanity. It also seems like there's a pretty strong argument that Instruction-following AGI is easier than value aligned AGI. In the slow-ish takeoff we expect, this alignment target seems to allow for error-correcting alignment [...]
---
Outline:
(00:43) The first AGIs will probably be aligned to take orders
(01:27) The first AGI probably wont perform a pivotal act
(02:30) So RSI-capable AGI may proliferate until a disaster occurs
(03:51) Counterarguments
The original text contained 1 footnote which was omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
26,409 Listeners
2,387 Listeners
7,908 Listeners
4,131 Listeners
87 Listeners
1,457 Listeners
9,042 Listeners
87 Listeners
388 Listeners
5,432 Listeners
15,201 Listeners
474 Listeners
122 Listeners
75 Listeners
454 Listeners