80k After Hours

Highlights: #217 – Beth Barnes on the most important graph in AI right now — and the 7-month rule that governs its progress


Listen Later

AI models today have a 50% chance of successfully completing a task that would take an expert human one hour. Seven months ago, that number was roughly 30 minutes — and seven months before that, 15 minutes.

These are substantial, multi-step tasks requiring sustained focus: building web applications, conducting machine learning research, or solving complex programming challenges.

Today’s guest, Beth Barnes, is CEO of METR (Model Evaluation & Threat Research) — the leading organisation measuring these capabilities.

These highlights are from episode #217 of The 80,000 Hours Podcast: Beth Barnes on the most important graph in AI right now — and the 7-month rule that governs its progress, and include:

  • Can we see AI scheming in the chain of thought? (00:00:34)
  • We have to test model honesty even before they're used inside AI companies (00:05:48)
  • It's essential to thoroughly test relevant real-world tasks (00:10:13)
  • Recursively self-improving AI might even be here in two years — which is alarming (00:16:09)
  • Do we need external auditors doing AI safety tests, not just the companies themselves? (00:21:55)
  • A case against safety-focused people working at frontier AI companies (00:29:30)
  • Open-weighting models is often good, and Beth has changed her attitude about it (00:34:57)

These aren't necessarily the most important or even most entertaining parts of the interview — so if you enjoy this, we strongly recommend checking out the full episode!

And if you're finding these highlights episodes valuable, please let us know by emailing [email protected].

Highlights put together by Ben Cordell, Milo McGuire, and Dominic Armstrong

...more
View all episodesView all episodes
Download on the App Store

80k After HoursBy The 80,000 Hours team

  • 5
  • 5
  • 5
  • 5
  • 5

5

15 ratings


More shows like 80k After Hours

View all
Planet Money by NPR

Planet Money

30,771 Listeners

The Right Time with Bomani Jones by Wave Originals

The Right Time with Bomani Jones

12,771 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,425 Listeners

Uncomfortable Conversations with Josh Szeps by Josh Szeps

Uncomfortable Conversations with Josh Szeps

853 Listeners

Hidden Brain by Hidden Brain, Shankar Vedantam

Hidden Brain

43,819 Listeners

Pod Save America by Crooked Media

Pod Save America

87,549 Listeners

The Daily by The New York Times

The Daily

112,398 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,058 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

453 Listeners

People I (Mostly) Admire by Freakonomics Radio + Stitcher

People I (Mostly) Admire

2,099 Listeners

Hard Fork by The New York Times

Hard Fork

5,505 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

129 Listeners

Huberman Lab by Scicomm Media

Huberman Lab

29,378 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,007 Listeners

How I Learned to Love Shrimp by James Özden

How I Learned to Love Shrimp

17 Listeners