
Sign up to save your podcasts
Or
Today we're joined by Jonas Geiping, a research group leader at the ELLIS Institute, to explore his paper: "Coercing LLMs to Do and Reveal (Almost) Anything". Jonas explains how neural networks can be exploited, highlighting the risk of deploying LLM agents that interact with the real world. We discuss the role of open models in enabling security research, the challenges of optimizing over certain constraints, and the ongoing difficulties in achieving robustness in neural networks. Finally, we delve into the future of AI security, and the need for a better approach to mitigate the risks posed by optimized adversarial attacks.
The complete show notes for this episode can be found at twimlai.com/go/678.
4.7
414414 ratings
Today we're joined by Jonas Geiping, a research group leader at the ELLIS Institute, to explore his paper: "Coercing LLMs to Do and Reveal (Almost) Anything". Jonas explains how neural networks can be exploited, highlighting the risk of deploying LLM agents that interact with the real world. We discuss the role of open models in enabling security research, the challenges of optimizing over certain constraints, and the ongoing difficulties in achieving robustness in neural networks. Finally, we delve into the future of AI security, and the need for a better approach to mitigate the risks posed by optimized adversarial attacks.
The complete show notes for this episode can be found at twimlai.com/go/678.
161 Listeners
480 Listeners
295 Listeners
325 Listeners
147 Listeners
265 Listeners
189 Listeners
290 Listeners
88 Listeners
123 Listeners
197 Listeners
76 Listeners
443 Listeners
30 Listeners
36 Listeners