December 15, 2025

“GPT-5.2 Is Frontier Only For The Frontier” by Zvi

Listen Later

43 minutes

Here we go again, only a few weeks after GPT-5.1 and a few more weeks after 5.0.

There weren’t major safety concerns with GPT-5.2, so I’ll start with capabilities, and only cover safety briefly starting with ‘Model Card and Safety Training’ near the end.

Table of Contents

The Bottom Line.

Introducing GPT-5.2.

Official Benchmarks.

GDPVal.

Unofficial Benchmarks.

Official Hype.

Public Reactions.

Positive Reactions.

Personality Clash.

Vibing the Code.

Negative Reactions.

But Thou Must (Follow The System Prompt).

Slow.

Model Card And Safety Training.

Deception.

Preparedness Framework.

Rush Job.

Frontier Or Bust.

The Bottom Line

ChatGPT-5.2 is a frontier model for those who need a frontier model.

It is not the step change that is implied by its headline benchmarks. It is rather slow.

Reaction was remarkably muted. People have new model fatigue. So we know less about it than we would have known about prior models after this length of time.

If you’re coding, compare it to Claude Opus 4.5 and choose what works best for you.

If you’re doing intellectually [...]

---

Outline:

(00:29) The Bottom Line

(01:58) Introducing GPT-5.2

(03:49) Official Benchmarks

(05:54) GDPVal

(08:14) Unofficial Benchmarks

(11:11) Official Hype

(12:36) Public Reactions

(12:59) Positive Reactions

(19:09) Personality Clash

(24:30) Vibing the Code

(27:25) Negative Reactions

(30:37) But Thou Must (Follow The System Prompt)

(33:09) Slow

(34:16) Model Card And Safety Training

(36:23) Deception

(38:10) Preparedness Framework

(40:10) Rush Job

(41:29) Frontier Or Bust

---

First published:

December 15th, 2025

Source:

https://www.lesswrong.com/posts/Do4eWro8E552isGi5/gpt-5-2-is-frontier-only-for-the-frontier

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

LessWrong posts by zvi

By zvi

5

22 ratings

December 15, 2025

“GPT-5.2 Is Frontier Only For The Frontier” by Zvi

Listen Later

43 minutes

Here we go again, only a few weeks after GPT-5.1 and a few more weeks after 5.0.

There weren’t major safety concerns with GPT-5.2, so I’ll start with capabilities, and only cover safety briefly starting with ‘Model Card and Safety Training’ near the end.

Table of Contents

The Bottom Line.

Introducing GPT-5.2.

Official Benchmarks.

GDPVal.

Unofficial Benchmarks.

Official Hype.

Public Reactions.

Positive Reactions.

Personality Clash.

Vibing the Code.

Negative Reactions.

But Thou Must (Follow The System Prompt).

Slow.

Model Card And Safety Training.

Deception.

Preparedness Framework.

Rush Job.

Frontier Or Bust.

The Bottom Line

ChatGPT-5.2 is a frontier model for those who need a frontier model.

It is not the step change that is implied by its headline benchmarks. It is rather slow.

Reaction was remarkably muted. People have new model fatigue. So we know less about it than we would have known about prior models after this length of time.

If you’re coding, compare it to Claude Opus 4.5 and choose what works best for you.

If you’re doing intellectually [...]

---

Outline:

(00:29) The Bottom Line

(01:58) Introducing GPT-5.2

(03:49) Official Benchmarks

(05:54) GDPVal

(08:14) Unofficial Benchmarks

(11:11) Official Hype

(12:36) Public Reactions

(12:59) Positive Reactions

(19:09) Personality Clash

(24:30) Vibing the Code

(27:25) Negative Reactions

(30:37) But Thou Must (Follow The System Prompt)

(33:09) Slow

(34:16) Model Card And Safety Training

(36:23) Deception

(38:10) Preparedness Framework

(40:10) Rush Job

(41:29) Frontier Or Bust

---

First published:

December 15th, 2025

Source:

https://www.lesswrong.com/posts/Do4eWro8E552isGi5/gpt-5-2-is-frontier-only-for-the-frontier

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

...more

More shows like LessWrong posts by zvi

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,276 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,448 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,106 Listeners

Future of Life Institute Podcast by Future of Life Institute

Future of Life Institute Podcast

108 Listeners

ChinaTalk by Jordan Schneider

ChinaTalk

289 Listeners

Politix by Politix

Politix

89 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

563 Listeners

Hard Fork by The New York Times

Hard Fork

5,549 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

138 Listeners

LessWrong (Curated & Popular) by LessWrong

LessWrong (Curated & Popular)

12 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

146 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

149 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

461 Listeners

LessWrong (30+ Karma) by LessWrong

LessWrong (30+ Karma)

0 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

141 Listeners