AI Security Podcast

AI Red Teaming & Securing Enterprise AI


Listen Later

As AI systems become more integrated into enterprise operations, understanding how to test their security effectively is paramount.

In this episode, we're joined by Leonard Tang, Co-founder and CEO of Haize Labs, to explore how AI red teaming is changing.

Leonard discusses the fundamental shifts in red teaming methodologies brought about by AI, common vulnerabilities he's observing in enterprise AI applications, and the emerging risks associated with multimodal AI (like voice and image processing systems). We delve into the intricacies of achieving precise output control for crafting sophisticated AI exploits, the challenges enterprises face in ensuring AI safety and reliability, and practical mitigation strategies they can implement.

Leonard shares his perspective on the future of AI red teaming, including the critical skills cybersecurity professionals will need to develop, the potential for fingerprinting AI models, and the ongoing discussion around protocols like MCP.


Questions asked:

  • 00:00 Intro: AI Red Teaming's Evolution
  • 01:50 Leonard Tang: Haize Labs & AI Expertise
  • 05:06 AI vs. Traditional Red Teaming (Enterprise View)
  • 06:18 AI Quality Assurance: The Haize Labs Perspective
  • 08:50 AI Red Teaming: Real-World Application Examples
  • 10:43 Major AI Risk: Multimodal Vulnerabilities Explained
  • 11:50 AI Exploit Example: Voice Injections via Background Noise
  • 15:41 AI Vulnerabilities & Early XSS: A Cybersecurity Analogy
  • 20:10 Expert AI Hacking: Precisely Controlling AI Output for Exploits
  • 21:45 The AI Fingerprinting Challenge: Identifying Chained Models
  • 25:48 Fingerprinting LLMs: The Reality & Detection Difficulty
  • 29:50 Top Enterprise AI Security Concerns: Reputation & Policy
  • 34:08 Enterprise AI: Model Choices (Frontier Labs vs. Open Source)
  • 34:55 Future of LLMs: Specialized Models & "Hot Swap" AI
  • 37:43 MCP for AI: Enterprise Ready or Still Too Early?
  • 44:50 AI Security: Mitigation with Precise Input/Output Classifiers
  • 49:50 Future Skills for AI Red Teamers: Discrete Optimization


Resources discussed during the episode:

Baselines for Watermarking Large Language Models

Haize Labs

...more
View all episodesView all episodes
Download on the App Store

AI Security PodcastBy Kaizenteq Team

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

9 ratings


More shows like AI Security Podcast

View all
The a16z Show by Andreessen Horowitz

The a16z Show

1,100 Listeners

Risky Business by Patrick Gray

Risky Business

374 Listeners

CyberWire Daily by N2K Networks

CyberWire Daily

1,034 Listeners

Invest Like the Best with Patrick O'Shaughnessy by Colossus | Investing & Business Podcasts

Invest Like the Best with Patrick O'Shaughnessy

2,343 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

348 Listeners

Cybersecurity Today by Jim Love

Cybersecurity Today

178 Listeners

Practical AI by Practical AI LLC

Practical AI

203 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

199 Listeners

Cloud Security Podcast by TechRiot.io

Cloud Security Podcast

58 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,278 Listeners

Cybersecurity Headlines by CISO Series

Cybersecurity Headlines

138 Listeners

Cloud Security Podcast by Google by Anton Chuvakin

Cloud Security Podcast by Google

40 Listeners

Honestly with Bari Weiss by The Free Press

Honestly with Bari Weiss

8,709 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

637 Listeners

AI + a16z by a16z

AI + a16z

33 Listeners