AI is naturally prone to being tricked into behaving badly, but researchers are working hard to patch that weakness. Andy Zou and Jason Hausenloy — Apr 9, 2025 Dan Hendrycks, the founder and Director of the Center for AI Safety (which funds this publication), was a co-author on the paper that introduced "circuit breakers," a machine learning technique discussed in this article. Image: Getty Images

AI is naturally prone to being tricked into behaving badly, but researchers are working hard to patch that weakness. Andy Zou and Jason Hausenloy — Apr 9, 2025 Dan Hendrycks, the founder and Director of the Center for AI Safety (which funds this publication), was a co-author on the paper that introduced "circuit breakers," a machine learning technique discussed in this article. Image: Getty Images

Can We Stop Bad Actors From Manipulating AI?

AI Frontiers is a platform for expert dialogue and debate on the impacts of artificial intelligence.

Sign up for our newsletter: 
https://ai-frontiers.org/subscribe

News

Technology

News Commentary

AI Frontiers is a platform for expert dialogue and debate on the impacts of artificial intelligence. Sign up for our newsletter: https://ai-frontiers.org/subscribe

Share Can We Stop Bad Actors From Manipulating AI?

Sign up to save your podcasts

Can We Stop Bad Actors From Manipulating AI?

Can We Stop Bad Actors From Manipulating AI?