
Sign up to save your podcasts
Or


We're creating incentives for AI systems to make their behavior look as desirable as possible, while intentionally disregarding human intent when that conflicts with maximizing reward: https://www.planned-obsolescence.org/the-training-game/
By Ajeya CotraWe're creating incentives for AI systems to make their behavior look as desirable as possible, while intentionally disregarding human intent when that conflicts with maximizing reward: https://www.planned-obsolescence.org/the-training-game/

26,260 Listeners

9,645 Listeners

87,532 Listeners

112,220 Listeners

56,587 Listeners

5,533 Listeners

9 Listeners

16,196 Listeners

12 Listeners

3,512 Listeners