Sign up to save your podcastsEmail addressPasswordRegisterOrContinue with GoogleAlready have an account? Log in here.
March 11, 2026JBDistill Generates Its Own Jailbreaks - 81.8% Attack Rate4 minutesPlayJohns Hopkins and Microsoft's JBDistill achieves 81.8% attack success rate across 13 LLMs by auto-generating fresh adversarial prompts on demand....moreShareView all episodesBy Awesome AgentsMarch 11, 2026JBDistill Generates Its Own Jailbreaks - 81.8% Attack Rate4 minutesPlayJohns Hopkins and Microsoft's JBDistill achieves 81.8% attack success rate across 13 LLMs by auto-generating fresh adversarial prompts on demand....more
Johns Hopkins and Microsoft's JBDistill achieves 81.8% attack success rate across 13 LLMs by auto-generating fresh adversarial prompts on demand.
March 11, 2026JBDistill Generates Its Own Jailbreaks - 81.8% Attack Rate4 minutesPlayJohns Hopkins and Microsoft's JBDistill achieves 81.8% attack success rate across 13 LLMs by auto-generating fresh adversarial prompts on demand....more
Johns Hopkins and Microsoft's JBDistill achieves 81.8% attack success rate across 13 LLMs by auto-generating fresh adversarial prompts on demand.