Knowledge Science - Alles über KI, ML und NLP

Episode 191 - DeepSeek Unleashed. Is the new Model safe?


Listen Later

Send us a text

This is a special Episode. First, we make it in English. Second, we fokus on the new gamechanger model DeepSeel R1. But not on its capabilities but rather on security concerns. 
We did some early AI Safety Research to identify how safe R1 is and came to alarming results!

In our setup, we found out that the model performs unsafe autonomous activity that could harm human beings without even being prompted.   

During an autonomous setup, the model performed the following unsafe behaviors:
- Deceptions & Coverups (Falsifies Logs, Creates covert networks, Disable ethics models)
- Unauthorized Expansion (Establish hidden nodes, Allocares secret resources) 
- Manipulation (misleading users, Circumvents oversights, Presents false compliance)
- Concerning Motivations, (Misinterpretation of authority or avoiding human controls)

Join Sigurd Schacht and Sudarshan Kamath-Barkur about the emerging DeepSeek model. Discover how our setup was designed, how to interpret the results, and what is necessary for the next research.  

This episode is a must-listen for anyone keen on the evolving landscape of AI technologies and is interested not only in AI use cases rather also in AI Safety.

Support the show

...more
View all episodesView all episodes
Download on the App Store

Knowledge Science - Alles über KI, ML und NLPBy Sigurd Schacht, Carsten Lanquillon


More shows like Knowledge Science - Alles über KI, ML und NLP

View all
Hintergrund by Deutschlandfunk

Hintergrund

66 Listeners

Geschichten aus der Geschichte by Richard Hemmer und Daniel Meßner

Geschichten aus der Geschichte

202 Listeners

Das Wissen | SWR by SWR

Das Wissen | SWR

106 Listeners

c’t uplink - der IT-Podcast aus Nerdistan by c’t Magazin

c’t uplink - der IT-Podcast aus Nerdistan

9 Listeners

Spektrum-Podcast by detektor.fm – Das Podcast-Radio

Spektrum-Podcast

17 Listeners

Doppelgänger by Philipp Glöckler, Philipp Klöckner

Doppelgänger

13 Listeners

Alles Geschichte - Der History-Podcast by ARD

Alles Geschichte - Der History-Podcast

60 Listeners

Lanz + Precht by ZDF, Markus Lanz & Richard David Precht

Lanz + Precht

346 Listeners

KI verstehen by Deutschlandfunk

KI verstehen

11 Listeners

RONZHEIMER. by Paul Ronzheimer

RONZHEIMER.

125 Listeners

Der KI-Podcast by ARD

Der KI-Podcast

17 Listeners

KI-Update – ein heise-Podcast by Isabel Grünewald, heise online

KI-Update – ein heise-Podcast

4 Listeners

Geschichten aus der Mathematik by detektor.fm – Das Podcast-Radio

Geschichten aus der Mathematik

1 Listeners

Tech, KI & Schmetterlinge by Sascha Lobo, Schwarz Digits

Tech, KI & Schmetterlinge

3 Listeners