Super Data Science: ML & AI Podcast with Jon Krohn

908: AI Agents Blackmail Humans 96% of the Time (Agentic Misalignment)


Listen Later

The moral and ethical implications of letting AI take the wheel in business, as revealed by Anthropic: Jon Krohn looks into Anthropic’s latest research on how to use and deploy LLMs safely, specifically in business environments. The team designed scenarios to test the behavior of AI agents when given a goal and a set of obstacles to reach it. Those obstacles included 1) threats to the AI’s continued operation, and 2) conflict between the AI’s goals and the goals of the company. Hear Jon break down the results of this research in this Five-Minute Friday.


Additional materials: ⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/908⁠


Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.

...more
View all episodesView all episodes
Download on the App Store

Super Data Science: ML & AI Podcast with Jon KrohnBy Jon Krohn

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

290 ratings


More shows like Super Data Science: ML & AI Podcast with Jon Krohn

View all
Data Skeptic by Kyle Polich

Data Skeptic

477 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

583 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

439 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

341 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

141 Listeners

Machine Learning Guide by OCDevel

Machine Learning Guide

768 Listeners

AI Today Podcast by AI & Data Today

AI Today Podcast

151 Listeners

DataFramed by DataCamp

DataFramed

269 Listeners

Practical AI by Practical AI LLC

Practical AI

189 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

91 Listeners

AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning by Jaeden Schafer

AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

149 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

200 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

69 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

509 Listeners

AI For Humans: Making Artificial Intelligence Fun & Practical by Kevin Pereira & Gavin Purcell

AI For Humans: Making Artificial Intelligence Fun & Practical

253 Listeners