Last Week in AI

#238 - GPT 5.4 mini, OpenAI Pivot, Mamba 3, Attention Residuals


Listen Later

Our 238th episode with a summary and discussion of last week's big AI news!

Recorded on 03/18/2026

Hosted by Andrey Kurenkov and Jeremie Harris

Feel free to email us your questions and feedback at [email protected] and/or [email protected]

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

In this episode:

* OpenAI released GPT-5.4 mini and nano with 400k-token context windows, higher per-token prices but claimed token-efficiency gains in Codex; nano is API-only and pitched for high-volume classification/data extraction despite a major price increase.

* Mistral open-sourced the Small 4 model family (MoE, 119B total/6B active) combining reasoning, multimodal, and coding-agent capabilities, and announced Forge to help businesses train or post-train custom models.

* Agent “operating system” competition intensified with Meta’s acquired Manus launching a local Mac agent, Nvidia announcing NeMo/“Open Shell” sandboxed agent runtime, and Nvidia also unveiling DLSS 5 plus major hardware forecasts including Groq LPU integration.

* Business and safety updates included OpenAI shifting focus toward productivity/enterprise amid competition, Microsoft reorganizing Copilot and frontier-model efforts, Meta delaying its next model, China-linked ByteDance deploying large Nvidia clusters abroad, and new safety work on steganography, chain-of-thought faithfulness, fine-tuning defenses, cyber-attack evals, and constitution/spec compliance.

A thank you to our current sponsors:

  • Box - visit Box.com/AI to learn more
  • ODSC AI - go to odsc.ai/east and use promo code LWAI for an additional 15% off your pass to ODSC AI East 2026.
  • Factor - head to factormeals.com/lwai50off and use code lwai50off to get 50 percent off and free breakfast for a year


Timestamps:

  • (00:00:10) Intro / Banter
  • (00:01:56) News Preview
  • Tools & Apps
  • (00:02:39) OpenAI ships GPT-5.4 mini and nano, faster and more capable but up to 4x pricier
  • (00:08:04) Mistral's new Small 4 model punches above its weight with 128 expert modules
  • (00:14:03) Meta's Manus launches 'My Computer' to turn your Mac into an AI agent - 9to5Mac
  • (00:17:57) NVIDIA Announces NemoClaw for the OpenClaw Community | NVIDIA Newsroom + Nvidia boosts knowledge work with Open Agent Development Platform
  • (00:24:09) DLSS 5 looks like a real-time generative AI filter for video games | The Verge
  • (00:26:36) OpenAI to Launch ChatGPT 'Adult Mode' Despite Warnings From Its Own Advisers - CNET
  • Applications & Business
  • (00:33:46) OpenAI Reportedly Pivoting to a Focus on Business and Productivity Only
  • (00:41:25) Nvidia GTC 2026: CEO Jensen Huang sees $1 trillion in orders for Blackwell and Vera Rubin through ’27
  • (00:45:44) Mistral launches Forge to help enterprises build their own AI models
  • (00:54:17) China's ByteDance gets access to top Nvidia AI chips, WSJ reports
  • (00:57:57) Meta Delays Rollout of New A.I. Model After Performance Concerns
  • (01:02:50) Microsoft Shakes Up AI Division As Copilot Falls Behind Google and OpenAI
  • Policy & Safety
  • (01:07:26) A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring
  • (01:13:09) Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought
  • (01:18:29) In-Training Defenses against Emergent Misalignment in Language Models
  • (01:23:07) How do frontier AI agents perform in multi-step cyber-attack scenarios?
  • (01:25:20) Eval awareness in Claude Opus 4.6’s BrowseComp performance
  • (01:29:49) Introducing Bloom: an open source tool for automated behavioral evaluations
  • (01:32:26) How well do models follow their constitutions?
  • (01:37:11) Nvidia’s H200 License Stirs Security Concern Among Top Democrats
  • Research & Advancements
  • (01:40:050) [2603.15031] Attention Residuals
  • (01:47:11) Mamba-3: Improved Sequence Modeling using State Space Principles


See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

...more
View all episodesView all episodes
Download on the App Store

Last Week in AIBy Skynet Today

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

306 ratings


More shows like Last Week in AI

View all
The a16z Show by Andreessen Horowitz

The a16z Show

1,103 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

305 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

343 Listeners

Practical AI by Practical AI LLC

Practical AI

211 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

101 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

552 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

513 Listeners

The Artificial Intelligence Show by Paul Roetzer and Mike Kaput

The Artificial Intelligence Show

214 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

150 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

101 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

228 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

687 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

112 Listeners

A Beginner's Guide to AI by Dietmar Fischer

A Beginner's Guide to AI

53 Listeners

The Next Wave - AI and The Future of Technology by Mindstream (Hubspot Media)

The Next Wave - AI and The Future of Technology

55 Listeners