
Sign up to save your podcasts
Or


Here we go again, only a few weeks after GPT-5.1 and a few more weeks after 5.0.
There weren’t major safety concerns with GPT-5.2, so I’ll start with capabilities, and only cover safety briefly starting with ‘Model Card and Safety Training’ near the end.
Table of Contents
The Bottom Line
ChatGPT-5.2 is a frontier model for those who need a frontier model.
It is not the step change that is implied by its headline benchmarks. It is rather slow.
Reaction was remarkably muted. People have new model fatigue. So we know less about it than we would have known about prior models after this length of time.
If you’re coding, compare it to Claude Opus 4.5 and choose what works best for you.
If you’re doing intellectually [...]
---
Outline:
(00:29) The Bottom Line
(01:58) Introducing GPT-5.2
(03:49) Official Benchmarks
(05:54) GDPVal
(08:14) Unofficial Benchmarks
(11:11) Official Hype
(12:36) Public Reactions
(12:59) Positive Reactions
(19:09) Personality Clash
(24:30) Vibing the Code
(27:25) Negative Reactions
(30:37) But Thou Must (Follow The System Prompt)
(33:09) Slow
(34:16) Model Card And Safety Training
(36:23) Deception
(38:10) Preparedness Framework
(40:10) Rush Job
(41:29) Frontier Or Bust
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
By zvi5
22 ratings
Here we go again, only a few weeks after GPT-5.1 and a few more weeks after 5.0.
There weren’t major safety concerns with GPT-5.2, so I’ll start with capabilities, and only cover safety briefly starting with ‘Model Card and Safety Training’ near the end.
Table of Contents
The Bottom Line
ChatGPT-5.2 is a frontier model for those who need a frontier model.
It is not the step change that is implied by its headline benchmarks. It is rather slow.
Reaction was remarkably muted. People have new model fatigue. So we know less about it than we would have known about prior models after this length of time.
If you’re coding, compare it to Claude Opus 4.5 and choose what works best for you.
If you’re doing intellectually [...]
---
Outline:
(00:29) The Bottom Line
(01:58) Introducing GPT-5.2
(03:49) Official Benchmarks
(05:54) GDPVal
(08:14) Unofficial Benchmarks
(11:11) Official Hype
(12:36) Public Reactions
(12:59) Positive Reactions
(19:09) Personality Clash
(24:30) Vibing the Code
(27:25) Negative Reactions
(30:37) But Thou Must (Follow The System Prompt)
(33:09) Slow
(34:16) Model Card And Safety Training
(36:23) Deception
(38:10) Preparedness Framework
(40:10) Rush Job
(41:29) Frontier Or Bust
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

26,276 Listeners

2,448 Listeners

1,106 Listeners

108 Listeners

289 Listeners

89 Listeners

563 Listeners

5,549 Listeners

138 Listeners

12 Listeners

146 Listeners

149 Listeners

461 Listeners

0 Listeners

141 Listeners