
Sign up to save your podcasts
Or


How do you extract prohibited information from ChatGPT? Grandma and DAN exploits trick language models into violating their own policies. Why these techniques work, what they reveal about LLM architecture, and how companies protect against prompt injection attacks. Solo episode on LLM security.
To stay in touch, sign up for our newsletter at https://www.superprompt.fm
By Tony Wan5
1717 ratings
How do you extract prohibited information from ChatGPT? Grandma and DAN exploits trick language models into violating their own policies. Why these techniques work, what they reveal about LLM architecture, and how companies protect against prompt injection attacks. Solo episode on LLM security.
To stay in touch, sign up for our newsletter at https://www.superprompt.fm

32,005 Listeners

26,217 Listeners

4,067 Listeners

1,092 Listeners

303 Listeners

334 Listeners

207 Listeners

9,927 Listeners

2,068 Listeners

197 Listeners

633 Listeners

616 Listeners

165 Listeners

466 Listeners

166 Listeners