Legal4Tech - The podcast

Adversarial Poetry: How Language Can Jailbreak AI with Federico Sartore and Matteo Prandi


Listen Later

In this episode of Legal4Tech - The Podcast, we chat with Federico Sartore and Matteo Prandi to unpack their groundbreaking research on AI safety.

We explore:

  • How poetry can be used as a universal jailbreak for large language models
  • Why adversarial language bypasses current safety filters
  • The limits of today's AI alignment and content moderation
  • What it means for regulation, ethics and the future of AI security

Join us for a deep dive into the hidden vulnerabilities of generative AI and the surprising power of humanistic approaches to tech.


Shownotes:

  1. Adversarial poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models
...more
View all episodesView all episodes
Download on the App Store

Legal4Tech - The podcastBy Legal4Tech