April 17, 2026

Measuring Machine Intelligence with Chris Painter

Listen Later

43 minutes

Artificial intelligence is advancing rapidly, but our ability to measure what these systems can actually do—and the risks they may pose—has lagged behind. Headline benchmarks and viral demos offer snapshots of a system's performance, but they say little about how AI behaves in complex real-world settings or how much autonomy models can sustain over time. As these systems take on more consequential roles, the challenge is not just building more powerful models, but developing credible ways to evaluate their capabilities and limits.

Chris Painter, president of Model Evaluation and Threat Research (METR), joins Oren to discuss how researchers are building new frameworks to assess AI systems and what those efforts reveal about the trajectory of machine intelligence. They explore “time horizon” as a measure of autonomy, the difficulty of evaluating alignment and sabotage risks, and the constraints posed by compute and organizational bottlenecks. They also consider what it will look like when AI systems begin contributing even more to their own development and their capabilities outpace our ability to measure them.

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

The American Compass Podcast

By American Compass

4.5

6161 ratings

April 17, 2026

Measuring Machine Intelligence with Chris Painter

Listen Later

43 minutes

Artificial intelligence is advancing rapidly, but our ability to measure what these systems can actually do—and the risks they may pose—has lagged behind. Headline benchmarks and viral demos offer snapshots of a system's performance, but they say little about how AI behaves in complex real-world settings or how much autonomy models can sustain over time. As these systems take on more consequential roles, the challenge is not just building more powerful models, but developing credible ways to evaluate their capabilities and limits.

Chris Painter, president of Model Evaluation and Threat Research (METR), joins Oren to discuss how researchers are building new frameworks to assess AI systems and what those efforts reveal about the trajectory of machine intelligence. They explore “time horizon” as a measure of autonomy, the difficulty of evaluating alignment and sabotage risks, and the constraints posed by compute and organizational bottlenecks. They also consider what it will look like when AI systems begin contributing even more to their own development and their capabilities outpace our ability to measure them.

...more

More shows like The American Compass Podcast

Odd Lots by Bloomberg

Odd Lots

1,999 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,448 Listeners

The Commentary Magazine Podcast by Commentary Magazine

The Commentary Magazine Podcast

5,173 Listeners

The Editors by National Review

The Editors

4,890 Listeners

The Remnant with Jonah Goldberg by The Dispatch

The Remnant with Jonah Goldberg

6,620 Listeners

Uncommon Knowledge by Hoover Institution

Uncommon Knowledge

2,047 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,248 Listeners

The American Mind Podcast by The Claremont Institute

The American Mind Podcast

1,229 Listeners

The Realignment by The Realignment

The Realignment

2,430 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,205 Listeners

GoodFellows: Conversations on Economics, History & Geopolitics by Hoover Institution

GoodFellows: Conversations on Economics, History & Geopolitics

702 Listeners

The Dishcast with Andrew Sullivan by Andrew Sullivan

The Dishcast with Andrew Sullivan

819 Listeners

Honestly with Bari Weiss by The Free Press

Honestly with Bari Weiss

8,446 Listeners

Central Air by Josh Barro, Megan McArdle & Ben Dreyfuss

Central Air

457 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

149 Listeners