Share The Attack Atlas by IBM Research

Copy link

October 05, 2024

The Attack Atlas by IBM Research

11 minutes

This episode explores the intricate world of red-teaming generative AI models as discussed in the paper "Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI." We'll dive into the emerging vulnerabilities as LLMs are increasingly integrated into real-world applications and the evolving tactics of adversarial attacks. Our conversation will center around the "Attack Atlas" - a practical framework that helps practitioners analyze and secure against single-turn input attacks - and we'll examine the critical challenges in both red- and blue-teaming generative AI systems. Whether you’re a security expert or simply fascinated by the defense of cutting-edge AI, this episode breaks down the complexities of safeguarding AI against adversarial threats.

Paper (preprint): Rawat, Ambrish, et al. "Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI." (2024). arXiv.

Disclaimer: This podcast was generated using Google's NotebookLM AI. While the summary aims to provide an overview, it is recommended to refer to the original research preprint for a comprehensive understanding of the study and its findings.

...more

View all episodes

By Arian Abbasi, Alan Aqrawi

October 05, 2024

The Attack Atlas by IBM Research

11 minutes

Paper (preprint): Rawat, Ambrish, et al. "Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI." (2024). arXiv.

...more

Sign up to save your podcasts