
Sign up to save your podcasts
Or


New model, new hype cycle, who dis?
On a Friday afternoon, OpenAI was proud to announce the new model o3-mini and also o3-mini-high which is somewhat less mini, or for some other reasoning tasks you might still want o1 if you want a broader knowledge base, or if you’re a pro user o1-pro, while we want for o3-not-mini and o3-pro, except o3 can use web search and o1 can’t so it has the better knowledge in that sense, then on a Sunday night they launched Deep Research which is different from Google's Deep Research but you only have a few of those queries so make them count, or maybe you want to use operator?
Get it? Got it? Good.
Yes, Pliny jailbroke o3-mini on the spot, as he always does.
This most mostly skips over OpenAI's Deep Research (o3-DR? OAI-DR?). I need more time for [...]
---
Outline:
(01:16) Feature Presentation
(04:37) QandA
(09:14) The Wrong Side of History
(13:29) The System Card
(22:08) The Official Benchmarks
(24:55) The Unofficial Benchmarks
(27:43) Others Report In
(29:47) Some People Need Practical Advice
---
First published:
Source:
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
By LessWrongNew model, new hype cycle, who dis?
On a Friday afternoon, OpenAI was proud to announce the new model o3-mini and also o3-mini-high which is somewhat less mini, or for some other reasoning tasks you might still want o1 if you want a broader knowledge base, or if you’re a pro user o1-pro, while we want for o3-not-mini and o3-pro, except o3 can use web search and o1 can’t so it has the better knowledge in that sense, then on a Sunday night they launched Deep Research which is different from Google's Deep Research but you only have a few of those queries so make them count, or maybe you want to use operator?
Get it? Got it? Good.
Yes, Pliny jailbroke o3-mini on the spot, as he always does.
This most mostly skips over OpenAI's Deep Research (o3-DR? OAI-DR?). I need more time for [...]
---
Outline:
(01:16) Feature Presentation
(04:37) QandA
(09:14) The Wrong Side of History
(13:29) The System Card
(22:08) The Official Benchmarks
(24:55) The Unofficial Benchmarks
(27:43) Others Report In
(29:47) Some People Need Practical Advice
---
First published:
Source:
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

26,337 Listeners

2,442 Listeners

9,188 Listeners

4,152 Listeners

92 Listeners

1,603 Listeners

9,899 Listeners

95 Listeners

502 Listeners

5,470 Listeners

16,097 Listeners

539 Listeners

133 Listeners

95 Listeners

514 Listeners