Last Week in AI

#228 - GPT 5.2, Scaling Agents, Weird Generalization


Listen Later

Our 228th episode with a summary and discussion of last week's big AI news!

Recorded on 12/12/2025

Hosted by Andrey Kurenkov and Jeremie Harris

Feel free to email us your questions and feedback at [email protected] and/or [email protected]

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

In this episode:

  • OpenAI's latest model GPT-5.2 demonstrates improved performance and enhanced multi-modal capabilities but comes with increased costs and a different knowledge cutoff date.
  • Disney invests $1 billion in OpenAI to generate Disney character content, creating unique licensing agreements across characters from Marvel, Pixar, and Star Wars franchises.
  • The U.S. government imposes new AI chip export rules involving security reviews, while simultaneously moving to prevent states from independently regulating AI.
  • DeepMind releases a paper outlining the challenges and findings in scaling multi-agent systems, highlighting the complexities of tool coordination and task performance.


Timestamps:

  • (00:00:00) Intro / Banter
  • (00:01:19) News Preview
  • Tools & Apps
  • (00:01:58) GPT-5.2 is OpenAI’s latest move in the agentic AI battle | The Verge
  • (00:08:48) Runway releases its first world model, adds native audio to latest video model | TechCrunch
  • (00:11:51) Google says it will link to more sources in AI Mode | The Verge
  • (00:12:24) ChatGPT can now use Adobe apps to edit your photos and PDFs for free | The Verge
  • (00:13:05) Tencent releases Hunyuan 2.0 with 406B parameters
  • Applications & Business
  • (00:16:15) China set to limit access to Nvidia’s H200 chips despite Trump export approval
  • (00:21:02) Disney investing $1 billion in OpenAI, will allow characters on Sora
  • (00:24:48) Unconventional AI confirms its massive $475M seed round
  • (00:29:06) Slack CEO Denise Dresser to join OpenAI as chief revenue officer | TechCrunch
  • (00:31:18) The state of enterprise AI
  • Projects & Open Source
  • (00:33:49) [2512.10791] The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality
  • (00:36:27) Claude 4.5 Opus' Soul Document
  • Research & Advancements
  • (00:43:49) [2512.08296] Towards a Science of Scaling Agent Systems
  • (00:48:43) Evaluating Gemini Robotics Policies in a Veo World Simulator
  • (00:52:10) Guided Self-Evolving LLMs with Minimal Human Supervision
  • (00:56:08) Martingale Score: An Unsupervised Metric for Bayesian Rationality in LLM Reasoning
  • (01:00:39) [2512.07783] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
  • (01:04:42) Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
  • (01:09:42) Google’s AI unit DeepMind announces UK 'automated research lab'
  • Policy & Safety
  • (01:10:28) Trump Moves to Stop States From Regulating AI With a New Executive Order - The New York Times
  • (01:13:54) [2512.09742] Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs
  • (01:17:57) Forecasting AI Time Horizon Under Compute Slowdowns
  • (01:20:46) AI Security Institute focuses on AI measurements and evaluations
  • (01:21:16) Nvidia AI Chips to Undergo Unusual U.S. Security Review Before Export to China
  • (01:22:01) U.S. Authorities Shut Down Major China-Linked AI Tech Smuggling Network
  • Synthetic Media & Art
  • (01:24:01) RSL 1.0 has arrived, allowing publishers to ask AI companies pay to scrape content | The Verge


See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

...more
View all episodesView all episodes
Download on the App Store

Last Week in AIBy Skynet Today

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

306 ratings


More shows like Last Week in AI

View all
The a16z Show by Andreessen Horowitz

The a16z Show

1,093 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

299 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

348 Listeners

Practical AI by Practical AI LLC

Practical AI

210 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

99 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

541 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

513 Listeners

The Artificial Intelligence Show by Paul Roetzer and Mike Kaput

The Artificial Intelligence Show

207 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

139 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

97 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

228 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

655 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

108 Listeners

A Beginner's Guide to AI by Dietmar Fischer

A Beginner's Guide to AI

59 Listeners

The Next Wave - AI and The Future of Technology by Mindstream (Hubspot Media)

The Next Wave - AI and The Future of Technology

60 Listeners