August 05, 2024

AI is learning how to lie

Listen Later

11 minutes

Large language models go through a lot of vetting before they’re released to the public. That includes safety tests, bias checks, ethical reviews and more. But what if, hypothetically, a model could dodge a safety question by lying to developers, hiding its real response to a safety test and instead giving the exact response its human handlers are looking for? A recent study shows that advanced LLMs are developing the capacity for deception, and that could bring that hypothetical situation closer to reality. Marketplace’s Lily Jamali speaks with Thilo Hagendorff, a researcher at the University of Stuttgart and the author of the study, about his findings.

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Marketplace Tech

By Marketplace

4.4

7373 ratings

August 05, 2024

AI is learning how to lie

Listen Later

11 minutes

Large language models go through a lot of vetting before they’re released to the public. That includes safety tests, bias checks, ethical reviews and more. But what if, hypothetically, a model could dodge a safety question by lying to developers, hiding its real response to a safety test and instead giving the exact response its human handlers are looking for? A recent study shows that advanced LLMs are developing the capacity for deception, and that could bring that hypothetical situation closer to reality. Marketplace’s Lily Jamali speaks with Thilo Hagendorff, a researcher at the University of Stuttgart and the author of the study, about his findings.

...more

More shows like Marketplace Tech

The NPR Politics Podcast by NPR

The NPR Politics Podcast

25,785 Listeners

Marketplace by Marketplace

Marketplace

8,767 Listeners

On the Media by WNYC Studios

On the Media

9,190 Listeners

PBS News Hour - Brooks and Capehart by PBS News

PBS News Hour - Brooks and Capehart

1,220 Listeners

The World by PRX

The World

937 Listeners

Fareed Zakaria GPS by CNN Podcasts

Fareed Zakaria GPS

3,404 Listeners

Marketplace Morning Report by Marketplace

Marketplace Morning Report

925 Listeners

Marketplace All-in-One by Marketplace

Marketplace All-in-One

1,388 Listeners

Marketplace Tech by Marketplace

Marketplace Tech

1,276 Listeners

Make Me Smart by Marketplace

Make Me Smart

5,489 Listeners

The Indicator from Planet Money by NPR

The Indicator from Planet Money

9,541 Listeners

Composers Datebook by American Public Media

Composers Datebook

10 Listeners

Piano Puzzler by American Public Media

Piano Puzzler

35 Listeners

Consider This from NPR by NPR

Consider This from NPR

6,403 Listeners

How We Survive by Marketplace

How We Survive

1,378 Listeners

Trump's Terms by NPR

Trump's Terms

398 Listeners

This Old House Radio Hour by American Public Media

This Old House Radio Hour

95 Listeners