September 18, 2023

Attacking LLMs for fun and profit (Ep. 239)

Listen Later

22 minutes

As a continuation of Episode 238, I explain some effective and fun attacks to conduct against LLMs. Such attacks are even more effective on models served locally, that are hardly controlled by human feedback.

Have great fun and learn them responsibly.

References

https://www.jailbreakchat.com/

https://www.reddit.com/r/ChatGPT/comments/10tevu1/new_jailbreak_proudly_unveiling_the_tried_and/

https://arxiv.org/abs/2305.13860

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Data Science at Home

By Francesco Gadaleta

4.2

7272 ratings

September 18, 2023

Attacking LLMs for fun and profit (Ep. 239)

Listen Later

22 minutes

As a continuation of Episode 238, I explain some effective and fun attacks to conduct against LLMs. Such attacks are even more effective on models served locally, that are hardly controlled by human feedback.

Have great fun and learn them responsibly.

References

https://www.jailbreakchat.com/

https://www.reddit.com/r/ChatGPT/comments/10tevu1/new_jailbreak_proudly_unveiling_the_tried_and/

https://arxiv.org/abs/2305.13860

...more

More shows like Data Science at Home

On Point with Meghna Chakrabarti by WBUR

On Point with Meghna Chakrabarti

4,022 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,380 Listeners

Nature Podcast by Springer Nature Limited

Nature Podcast

756 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

626 Listeners

Science Vs by Spotify Studios

Science Vs

12,130 Listeners

Science Friday by Science Friday and WNYC Studios

Science Friday

6,467 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

306 Listeners

The Daily by The New York Times

The Daily

113,121 Listeners

Up First from NPR by NPR

Up First from NPR

56,944 Listeners

The Atlantic Interview by The Atlantic

The Atlantic Interview

14 Listeners

Modern Wisdom by Chris Williamson

Modern Wisdom

4,025 Listeners

The Peter Attia Drive by Peter Attia, MD

The Peter Attia Drive

8,043 Listeners

Practical AI by Practical AI LLC

Practical AI

212 Listeners

Consider This from NPR by NPR

Consider This from NPR

6,462 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,525 Listeners