
Sign up to save your podcasts
Or


We're creating incentives for AI systems to make their behavior look as desirable as possible, while intentionally disregarding human intent when that conflicts with maximizing reward: https://www.planned-obsolescence.org/the-training-game/
By Ajeya CotraWe're creating incentives for AI systems to make their behavior look as desirable as possible, while intentionally disregarding human intent when that conflicts with maximizing reward: https://www.planned-obsolescence.org/the-training-game/

26,380 Listeners

9,724 Listeners

87,868 Listeners

113,121 Listeners

56,944 Listeners

5,576 Listeners

9 Listeners

16,525 Listeners

13 Listeners

3,538 Listeners