Data Science at Home

Attacking LLMs for fun and profit (Ep. 239)


Listen Later

As a continuation of Episode 238, I explain some effective and fun attacks to conduct against LLMs. Such attacks are even more effective on models served locally, that are hardly controlled by human feedback.

Have great fun and learn them responsibly.

 

References

https://www.jailbreakchat.com/

https://www.reddit.com/r/ChatGPT/comments/10tevu1/new_jailbreak_proudly_unveiling_the_tried_and/

https://arxiv.org/abs/2305.13860

 

...more
View all episodesView all episodes
Download on the App Store

Data Science at HomeBy Francesco Gadaleta

  • 4.2
  • 4.2
  • 4.2
  • 4.2
  • 4.2

4.2

72 ratings


More shows like Data Science at Home

View all
Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

31,971 Listeners

Global News Podcast by BBC World Service

Global News Podcast

7,584 Listeners

WSJ Your Money Briefing by The Wall Street Journal

WSJ Your Money Briefing

1,706 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,091 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

623 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

585 Listeners

Science Magazine Podcast by Science Magazine

Science Magazine Podcast

823 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

301 Listeners

FT Tech Tonic by Financial Times

FT Tech Tonic

99 Listeners

Worklife with Adam Grant by TED

Worklife with Adam Grant

9,161 Listeners

Practical AI by Practical AI LLC

Practical AI

207 Listeners

Last Week in AI by Skynet Today

Last Week in AI

306 Listeners

Hard Fork by The New York Times

Hard Fork

5,512 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

228 Listeners

The Last Invention by Longview

The Last Invention

1,104 Listeners