
Sign up to save your podcasts
Or


This talk examines whether advanced AIs that perform well in training will be doing so in order to gain power later — a behavior Joe Carlsmith calls "scheming" (also often called "deceptive alignment"). This talk gives an overview of his recent report on the topic, available on arXiv here: https://arxiv.org/abs/2311.08379.
Watch on Youtube: https://www.youtube.com/watch?v=AxUTiGS6BHM
By Aaron BergmanThis talk examines whether advanced AIs that perform well in training will be doing so in order to gain power later — a behavior Joe Carlsmith calls "scheming" (also often called "deceptive alignment"). This talk gives an overview of his recent report on the topic, available on arXiv here: https://arxiv.org/abs/2311.08379.
Watch on Youtube: https://www.youtube.com/watch?v=AxUTiGS6BHM