The Quanta Podcast

AI Filters Will Always Have Holes


Listen Later

Ask ChatGPT how to build a bomb, and it will flatly respond that it “can’t help with that.” But users have long played a cat-and-mouse game to try to trick language models into providing forbidden information. Just as quickly as these “jailbreaks” appear, AI companies patch them by simply filtering out forbidden prompts before they ever reach the model itself.

Recently, cryptographers have shown how the defensive filters put around powerful language models can be subverted by well-studied cryptographic tools. In fact, they’ve shown how the very nature of this two-tier system — a filter that protects a powerful language model inside it — creates gaps in the defenses that can always be exploited. In this episode, Quanta executive editor Michael Moyer tells Samir Patel about the findings and implications of this new work.

Audio coda courtesy of Banana Breakdown.

...more
View all episodesView all episodes
Download on the App Store

The Quanta PodcastBy Quanta Magazine

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

515 ratings


More shows like The Quanta Podcast

View all
Planetary Radio: Space Exploration, Astronomy and Science by The Planetary Society

Planetary Radio: Space Exploration, Astronomy and Science

1,357 Listeners

SpaceTime with Stuart Gary by Stuart Gary

SpaceTime with Stuart Gary

320 Listeners

Ask a Spaceman! by Paul M. Sutter

Ask a Spaceman!

844 Listeners

Universe Today Podcast by Fraser Cain

Universe Today Podcast

563 Listeners

Space Nuts: Astronomy Insights & Cosmic Discoveries by Professor Fred Watson and Andrew Dunkley

Space Nuts: Astronomy Insights & Cosmic Discoveries

238 Listeners

Science Magazine Podcast by Science Magazine

Science Magazine Podcast

827 Listeners

Into the Impossible With Brian Keating by Big Bang Productions Inc.

Into the Impossible With Brian Keating

1,070 Listeners

The Michael Shermer Show by Michael Shermer

The Michael Shermer Show

941 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll | Wondery

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,198 Listeners

Daniel and Kelly’s Extraordinary Universe by iHeartPodcasts

Daniel and Kelly’s Extraordinary Universe

2,361 Listeners

The Origins Podcast with Lawrence Krauss by Lawrence M. Krauss

The Origins Podcast with Lawrence Krauss

507 Listeners

The Joy of x by Steven Strogatz and Quanta Magazine

The Joy of x

252 Listeners

The Supermassive Podcast by The Royal Astronomical Society

The Supermassive Podcast

324 Listeners

Theories of Everything with Curt Jaimungal by Theories of Everything

Theories of Everything with Curt Jaimungal

21 Listeners

Why This Universe? by Dan Hooper, Shalma Wegsman

Why This Universe?

388 Listeners

The Joy of Why by Steven Strogatz, Janna Levin and Quanta Magazine

The Joy of Why

488 Listeners