Last Week in AI

#235 - Sonnet 4.6, Deep-thinking tokens, Anthropic vs Pentagon


Listen Later

Our 235th episode with a summary and discussion of last week's big AI news!

Recorded on 02/27/2026

Hosted by Andrey Kurenkov and Jeremie Harris

Feel free to email us your questions and feedback at [email protected] and/or [email protected]

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

In this episode:

  • Model and tool updates highlight Anthropic’s Sonnet 4.6 (1M context; strong ARC-AGI-2 results), Google’s Gemini 3.1 Pro (major ARC-AGI-2 jump and multimodal demos), xAI’s Grok 4.2 beta (multi-agent debate), plus Anthropic’s Claude Code “Remote Control” and Perplexity’s multi-agent “Computer” coordinator.
  • Compute and business moves include Meta’s reported up-to-$100B AMD chip deal with warrant/equity incentives, MatX raising $500M to build specialized transformer chips shipping in 2027, World Labs raising $1B for world-model/3D environment tech, and a new startup raising $100M to simulate/predict human behavior.
  • Infrastructure and geopolitics cover Stargate data-center delays amid OpenAI/Oracle/SoftBank control disputes and cash concerns, and China’s plan to scale 7nm/5nm wafer output despite yield and tooling constraints.
  • Research and safety/policy discuss optimizer gains from masked updates, “deep thinking tokens” as a reasoning-effort signal, LLM attractor-state behaviors in bot-to-bot chats, mechanistic interpretability of counting/line-wrapping, methods to map task difficulty to human time horizons, plus Anthropic–Pentagon contract tensions, Anthropic’s report on distillation attacks (DeepSeek/Moonshot/Minimax), and OpenAI’s report on disrupting malicious use.


A thank you to our current sponsors:

  • Box - visit Box.com/AI to learn more
  • ODSC AI - go to odsc.ai/east and use promo code LWAI for an additional 15% off your pass to ODSC AI East 2026.
  • Factor - head to factormeals.com/lwai50off and use code lwai50off to get 50 percent off and free breakfast for a year


Timestamps:

  • (00:00:10) Intro / Banter
  • (00:01:52) News Preview
  • Tools & Apps
  • (00:03:20) Anthropic releases Sonnet 4.6 | TechCrunch
  • (00:11:24) Google Rolls Out Latest AI Model, Gemini 3.1 Pro - CNET
  • (00:14:54) Elon Musk says Grok 4.20 public beta is now available: Capabilities of AI chatbot offered by xAI - The Times of India
  • (00:18:06) Anthropic just released a mobile version of Claude Code called Remote Control | VentureBeat
  • (00:21:01) Perplexity announces "Computer," an AI agent that assigns work to other AI agents - Ars Technica
  • Applications & Business
  • (00:23:40) Meta strikes up to $100B AMD chip deal as it chases 'personal superintelligence' | TechCrunch
  • (00:27:05) Nvidia challenger AI chip startup MatX raised $500M | TechCrunch
  • (00:31:00) World Labs lands $1B, with $200M from Autodesk, to bring world models into 3D workflows | TechCrunch
  • (00:33:07) Simile Raises $100 Million for AI Aiming to Predict Human Behavior
  • (00:33:52) Stargate AI data centers for OpenAI reportedly delayed by squabbles between partners — sources say OpenAI, Oracle, and SoftBank disagreed on who would have ultimate control of the planned data centers
  • (00:36:43) China to increase leading-edge chip output by 5x in two years, report claims — aims to lift 7nm and 5nm production to 100,000 wafers per month, targeting half a million monthly by 2030
  • Research & Advancements
  • (00:40:33) On Surprising Effectiveness of Masking Updates in Adaptive Optimizers
  • (00:48:03) Think Deep, Not Just Long: Measuring LLM Reasoning Effort via Deep-Thinking Tokens
  • (00:54:52) models have some pretty funny attractor states
  • (01:01:41) When Models Manipulate Manifolds: The Geometry of a Counting Task
  • (01:05:16) BRIDGE: Predicting Human Task Completion Time From Model Performance
  • (01:12:00) NESSiE: The Necessary Safety Benchmark -- Identifying Errors that should not Exist
  • (01:13:15) The least understood driver of AI progress
  • (01:21:45) The Persona Selection Model: Why AI Assistants might Behave like Humans
  • Policy & Safety
  • (01:25:04) Anthropic CEO Amodei says Pentagon's threats 'do not change our position' on AI
  • (01:33:04) Musk's xAI, Pentagon reach deal to use Grok in classified systems
  • (01:34:17) Detecting and preventing distillation attacks
  • (01:38:36) OpenAI details expanding efforts to disrupt malicious use of AI in new report - SiliconANGLE

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

...more
View all episodesView all episodes
Download on the App Store

Last Week in AIBy Skynet Today

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

306 ratings


More shows like Last Week in AI

View all
The a16z Show by Andreessen Horowitz

The a16z Show

1,096 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

306 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

347 Listeners

Practical AI by Practical AI LLC

Practical AI

210 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

100 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

564 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

512 Listeners

The Artificial Intelligence Show by Paul Roetzer and Mike Kaput

The Artificial Intelligence Show

209 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

144 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

101 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

227 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

676 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

110 Listeners

A Beginner's Guide to AI by Dietmar Fischer

A Beginner's Guide to AI

58 Listeners

The Next Wave - AI and The Future of Technology by Mindstream (Hubspot Media)

The Next Wave - AI and The Future of Technology

56 Listeners