AI Explained Official Podcast

o3 breaks (some) records, but AI becomes pay-to-win


Listen Later

A green card, o3 vs Gemini 2.5, 6 Benchmarks and a whole bunch of my thoughts on what on earth is happening in AI, from here to 2030. Plus, how AI is becoming pay-to-win, and why. Crazy times, 14 mins probably wasn’t enough.

https://app.grayswan.ai/ai-explained

AI Insiders ($9!): https://www.patreon.com/AIExplained

Chapters:
00:00 - Introduction
00:33 - FictionLiveBench
01:37 - PHYBench
02:14 - SimpleBench
02:54 - Virology Capabilities Test
03:13 - Mathematics Performance
04:29 - Vision Benchmarks
05:43 - V* and how o3 works
06:44 - Revenue and costs for you
08:54 - Expensive RL and trade-offs 
09:40 - How to spend the OOMs
13:27 - Gray Swan Arena

Green Card: https://techcrunch.com/2025/04/25/an-openai-researcher-who-worked-on-gpt-4-5-had-their-green-card-denied/
PHYBench: https://arxiv.org/pdf/2504.16074Virologytest: https://www.virologytest.ai/
How o3 Vision Works: https://arxiv.org/pdf/2312.14135 https://x.com/sainingxie/status/1912570624523829573
Visual puzzles: https://neulab.github.io/VisualPuzzles/
Fiction Bench: https://x.com/ficlive/status/1912863028141244850
https://geobench.org/
https://simple-bench.com/
AIME 2025: https://openai.com/index/introducing-o3-and-o4-mini/
USAMO: https://x.com/mbalunovic/status/1914398518896193747
NaturalBench: https://linzhiqiu.github.io/papers/naturalbench/
Where’s Waldo: https://uk.pinterest.com/pin/492792384225896298/
IMO and AlphaProof:https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/
Crazy Revenue: https://www.theinformation.com/articles/openai-forecasts-revenue-topping-125-billion-2029-agents-new-products-gain?rc=sy0ihq
Number of Users: https://www.theinformation.com/briefings/googles-gemini-user-numbers-revealed-court?rc=sy0ihq
Subscriptions pay to win: https://www.forbes.com/sites/paulmonckton/2025/04/23/google-leak-reveals-new-gemini-ai-subscription-levels/
GPU Trade-offs: https://x.com/sama/status/1915098951067554030
RL Scale-up Amodei: https://www.darioamodei.com/post/on-deepseek-and-export-controls
Log-linear Returns: https://x.com/bobmcgrewai/status/1895228291981943265
2030 Scaling: https://epoch.ai/blog/can-ai-scaling-continue-through-2030
Model Size: https://x.com/slow_developer/status/1874554473256997201
Adam on AGI: https://x.com/TheRealAdamG/status/1913998366632968381
Papers on Patreon: https://arxiv.org/pdf/2502.01839
https://arxiv.org/pdf/2504.13837
Chollet Quote: https://x.com/fchollet/status/1912934762580447447
OpenSim: https://opensim.stanford.edu/


Non-hype Newsletter: https://signaltonoise.beehiiv.com/

...more
View all episodesView all episodes
Download on the App Store

AI Explained Official PodcastBy Philip - Host of AI Explained YT

  • 3.1
  • 3.1
  • 3.1
  • 3.1
  • 3.1

3.1

9 ratings


More shows like AI Explained Official Podcast

View all
NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

348 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

201 Listeners

Last Week in AI by Skynet Today

Last Week in AI

310 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

98 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

529 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

512 Listeners

Hard Fork by The New York Times

Hard Fork

5,548 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

142 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

98 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

226 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

638 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

106 Listeners

Money Stuff: The Podcast by Bloomberg

Money Stuff: The Podcast

403 Listeners

AI Explored by Michael Stelzner, Social Media Examiner—AI marketing

AI Explored

99 Listeners

How I AI by Claire Vo

How I AI

151 Listeners