Intellectually Curious

Proofs on the Whiteboard: GPT-5 and the First Proof Challenge


Listen Later

We unpack OpenAI’s February 2026 first proof challenge, where GPT-5 and GPT-5.2 used a true internal reasoning process—more like a tree search than a word predictor—to tackle 10 research-grade problems in topology and physics. Through a collaborative generate-solve-refine workflow with human supervision, the model solved five problems (4, 5, 6, 9, 10) and had problem 2 retracted after peer review. We dive into the one-sided matrix barrier argument in problem 6 and discuss what this means for AI as a true reasoning partner in science and industry. 


Note:  This podcast was AI-generated, and sometimes AI can make mistakes.  Please double-check any critical information.

Sponsored by Embersilk LLC

...more
View all episodesView all episodes
Download on the App Store

Intellectually CuriousBy Mike Breault