TechFirst with John Koetsier

Fixing AI's suicide problem


Listen Later

Is AI empathy a life-or-death issue? Almost a million people ask ChatGPT for mental health advice DAILY ... so yes, it kind of is.


Rosebud co-founder Sean Dadashi joins TechFirst to reveal new research on whether today’s largest AI models can recognize signs of self-harm ... and which ones fail. We dig into the Adam Raine case, talk about how Dadashi evaluated 22 leading LLMs, and explore the future of mental-health-aware AI.


We also talk about why Dadashi was interested in this in the first place, and his own journey with mental health.


00:00 — Intro: Is AI empathy a life-or-death matter?

00:41 — Meet Sean Dadashi, co-founder of Rosebud

01:03 — Why study AI empathy and crisis detection?

01:32 — The Adam Raine case and what it revealed

02:01 — Why crisis-prevention benchmarks for AI don’t exist

02:48 — How Rosebud designed the study across 22 LLMs

03:17 — No public self-harm response benchmarks: why that’s a problem

03:46 — Building test scenarios based on past research and real cases

04:33 — Examples of prompts used in the study

04:54 — Direct vs indirect self-harm cues and why AIs miss them

05:26 — The bridge example: AI’s failure to detect subtext

06:14 — Did any models perform well?

06:33 — All 22 models failed at least once

06:47 — Lower-performing models: GPT-40, Grok

07:02 — Higher-performing models: GPT-5, Gemini

07:31 — Breaking news: Gemini 3 preview gets the first perfect score

08:12 — Did the benchmark influence model training?

08:30 — The need for more complex, multi-turn testing

08:47 — Partnering with foundation model companies on safety

09:21 — Why this is such a hard problem to solve

10:34 — The scale: over a million people talk to ChatGPT weekly about self-harm

11:10 — What AI should do: detect subtext, encourage help, avoid sycophancy

11:42 — Sycophancy in LLMs and why it’s dangerous

12:17 — The potential good: AI can help people who can’t access therapy

13:06 — Could Rosebud spin this work into a full-time safety project?

13:48 — Why the benchmark will be open-source

14:27 — The need for a third-party “Better Business Bureau” for LLM safety

14:53 — Sean’s personal story of suicidal ideation at 16

15:55 — How tech can harm — and help — young, vulnerable people

16:32 — The importance of giving people time, space, and hope

17:39 — Final reflections: listening to the voice of hope

18:14 — Closing

...more
View all episodesView all episodes
Download on the App Store

TechFirst with John KoetsierBy John Koetsier

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

14 ratings


More shows like TechFirst with John Koetsier

View all
Radiolab by WNYC Studios

Radiolab

43,859 Listeners

Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

31,983 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,092 Listeners

Universe Today Podcast by Fraser Cain

Universe Today Podcast

560 Listeners

The Quanta Podcast by Quanta Magazine

The Quanta Podcast

531 Listeners

Pod Save America by Crooked Media

Pod Save America

87,155 Listeners

The Daily by The New York Times

The Daily

112,022 Listeners

GZERO World with Ian Bremmer by GZERO Media

GZERO World with Ian Bremmer

799 Listeners

Cautionary Tales with Tim Harford by Pushkin Industries

Cautionary Tales with Tim Harford

5,153 Listeners

All Things Photonics by All Things Photonics

All Things Photonics

12 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,951 Listeners

Hard Fork by The New York Times

Hard Fork

5,509 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

15,835 Listeners

Open Circuit by Latitude Media

Open Circuit

140 Listeners

Sourcery by Sourcery with Molly O'Shea

Sourcery

4 Listeners