LessWrong posts by zvi

“GPT-5.5: Capabilities and Reactions” by Zvi


Listen Later

The system card for GPT-5.5 mostly told us what we expected. See this thread from Drake Thomas for some comparisons to Anthropic's model card for Opus 4.7.

Now we move on to asking what it means in practice, and in what situations GPT-5.5 should become our new weapon of choice.

My answer is for some purposes yes, and for others no, but it is now competitive. GPT-5.5 is like GPT-5.4, only more so, and with improved capabilities in particular on raw intelligence and for well-specified coding and agent tasks, including computer use.

This is the first time since Claude Opus 4.5 came out, so in about four months, that I’ve considered a non-Anthropic model a competitive choice outside of some narrow tasks like web search. GPT-5.5 is not perfect, nor is it the best at everything, but basically everyone thinks this is a solid upgrade. Highly positive overall feedback.

My effective usage is now split between the two, depending on the nature of the task. If it's something that can be well-specified and all I want is the right answer, my instinct is I go with GPT-5.5. If I’m not sure what exactly I want [...]

---

Outline:

(02:20) The Official Pitch

(07:49) Our Price Cheap

(08:29) Official Benchmarks

(11:58) SemiAnalysis Doublecheck

(12:38) Other Peoples Benchmarks

(16:00) Vend That Bench

(19:06) Planning Is Essential

(20:43) Choose Your Fighter

(22:44) Cyber Lack Of Security

(23:12) You Get What You Give

(24:20) True Story

(25:33) Ethan Mollick Thinks GPT-5.5 Is A Big Deal

(26:04) SemiAnalysis Loves GPT-5.5 Especially In Codex

(28:27) Choose Your Fighter

(29:13) Positive Reactions

(36:59) Lazy and Literal

(38:09) Goblins, Gremlins and Trolls, Oh My

(40:02) Other Reactions

(40:34) Claude Ambition

(41:00) Other Notes

---

First published:

April 28th, 2026

Source:

https://www.lesswrong.com/posts/5ytcFayxqZsXN8rNw/gpt-5-5-capabilities-and-reactions

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

...more
View all episodesView all episodes
Download on the App Store

LessWrong posts by zviBy zvi

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like LessWrong posts by zvi

View all
Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,278 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,448 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,107 Listeners

Future of Life Institute Podcast by Future of Life Institute

Future of Life Institute Podcast

108 Listeners

ChinaTalk by Jordan Schneider

ChinaTalk

288 Listeners

Politix by Politix

Politix

89 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

564 Listeners

Hard Fork by The New York Times

Hard Fork

5,554 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

138 Listeners

LessWrong (Curated & Popular) by LessWrong

LessWrong (Curated & Popular)

12 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

146 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

149 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

460 Listeners

LessWrong (30+ Karma) by LessWrong

LessWrong (30+ Karma)

0 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

141 Listeners