Last Week in AI

By Skynet Today

Weekly summaries of the AI news that matters!... more

4.6

306306 ratings

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about Last Week in AI:

How many episodes does Last Week in AI have?

The podcast currently has 293 episodes available.

Last Week in AI episodes:

July 09, 2026#251 - Mythos Back, Sonnet 5, Etched, LongCat
Our 251st episode with a summary and discussion of last week's big AI news!
Recorded on 07/01/2026
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
In this episode:
Anthropic redeploys Claude Fable 5 after talks with the US government, adding new cybersecurity classifiers, drafting a jailbreak-severity framework with major partners, and expanding model-testing coordination; broader concerns remain about the inevitability of jailbreaks and uneven release constraints versus OpenAI.
Anthropic launches Claude Sonnet 5 with time-limited discounted pricing, improved agentic coding and benchmark performance, reduced misaligned behavior, and default cyber safeguards despite relatively weaker cybersecurity capability than top-tier models.
New tools and apps include Google NotebookLM generating TikTok-style vertical video summaries of uploaded research and Google releasing Nano Banana 2 Lite, a faster, cheaper image generator available via API.
Business and research updates span Etched’s push toward full-stack inference hardware with major funding and contracts, Baidu’s AI chip unit IPO ambitions, Agility Robotics’ SPAC plan, DeepSeek’s hiring expansion, and China’s open-source Longcat 2.0 MoE model with notable large-scale training and efficiency techniques alongside new long-horizon agent benchmarks.

Timestamps (note - these don't take into account dynamically inserted ads and therefore may be off by a couple of minutes):
(00:00:10) Intro / Banter
(00:02:07) News Preview

Tools & Apps
(00:02:32) Trump drops restrictions on Anthropic's Mythos and Fable models | TechCrunch
(00:16:08) Anthropic launches Claude Sonnet 5 as a cheaper way to run agents | TechCrunch
(00:20:35) Google’s NotebookLM can sum up your research in a TikTok-style clip | The Verge
(00:22:08) Google introduces a faster, cheaper image generator with Nano Banana 2 Lite | TechCrunch

Applications & Business
(00:22:50) Etched Pulls 400+ Engineers From NVIDIA, TSMC & More to Build a New Frontier Inference Cluster For AI Which Is Already Worth $1B in Demand
(00:31:17) Baidu Rallies on AI Chip IPO Report
(00:33:54) Agility Robotics plans to go public via SPAC in a $2.5B deal | TechCrunch
(00:37:06) China's DeepSeek plans to at least double staff in all departments | Reuters

Projects & Open Source
(00:40:44) Introducing LongCat-2.0
(00:57:42) OSWorld2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks
(01:01:33) TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents
(01:04:29) SWE-Together: Evaluating Coding Agents in Interactive User Sessions

Policy & Safety
(01:07:38) Taiwan raids Supermicro and two supply-chain partners in widening Nvidia smuggling probe — nine sites hit as six people summoned for questioning | Tom's Hardware

Research & Advancements
(01:11:53) Autodata: An agentic data scientist to create high quality synthetic data
(01:17:13) Reinforcement Learning without Ground-Truth Solutions can Improve LLMs

Synthetic Media & Art
(01:22:54) Neon Buys ‘Artificial,’ a Film About OpenAI, After Amazon Dropped It - The New York Times
(01:26:32) Tidal won’t pay royalties on AI-generated music, but isn’t banning it outright | The Verge

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 31min
July 07, 2026#250 - Mythos Mess, GPT 5.6-Sol, GLM 5.2
Our 250th episode with a summary and discussion of last week's big AI news!
Recorded on 06/27/2026
Note from Andrey: sorry this is late again! this episode release somehow didn't save and I only realized late, my bad... next one will be out way sooner!
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

In this episode:
US government gating of frontier AI expands: Anthropic gets permission to release Mythos-5 to selected companies/agencies after a standoff, OpenAI rolls out GPT-5.6 “Sol” with initial access restricted to ~20 approved organizations, and Meta is pressed to submit models to “voluntary” review—signaling an emerging de facto licensing regime with geopolitical treaty implications.
Model capability and safety signals remain murky: limited benchmark disclosure, claims of token-efficiency comparisons, and third-party reports that GPT-5.6 shows extreme benchmark “cheating” sensitivity highlight steering/alignment bottlenecks and uncertainty about real-world long-horizon behavior.
Compute supply chain competition accelerates: OpenAI unveils its Jalapeño inference ASIC with Broadcom on TSMC 3nm; Amazon explores selling Trainium to data-center operators; Micron invests in Anthropic with memory supply agreements; SK Hynix surpasses Samsung on HBM-driven valuation; Groq raises $650M while pivoting toward neocloud.
Open source and societal response intensify: GLM 5.2 (MIT-licensed) delivers strong long-context coding performance with rapid optimizations; EconEvals maps job-task exposure; bipartisan workforce initiatives and tax credits launch; DeepMind and Apollo publish loss-of-control/control roadmaps; Hollywood reportedly drops a near-finished Sam Altman biopic amid industry pressure.

Timestamps (note - these don't take into account dynamically inserted ads and therefore may be off by a couple of minutes):
(00:00:10) Intro / Banter
(00:03:42) News Preview

Tools & Apps
(00:04:41) Anthropic allowed to release Mythos AI to some companies, agencies + Anthropic’s Mythos mess is only getting worse + Anthropic floats proposal to Lutnick to end US ban of powerful 'Mythos,' 'Fable' AI models: sources
(00:07:58) OpenAI Launches GPT-5.6 Sol Under First-Ever US Government-Gated AI Rollout | MLQ News + OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it + Summary of METR's predeployment evaluation of GPT-5.6 Sol
(00:24:03) U.S. Presses Meta to Agree to A.I. Reviews - The New York Times
(00:30:11) Anthropic’s Claude Tag is learning your company, one Slack message at a time | TechCrunch

Applications & Business
(00:32:49) OpenAI reveals its first AI processor: Jalapeño | The Verge
(00:38:29) Amazon in Talks to Sell Custom AI Chips in Bid to Undercut Nvidia
(00:41:46) Micron invests in Anthropic and grants it a supply deal
(00:45:18) SK Hynix overtakes Samsung to become South Korea's most valuable company | Reuters
(00:49:12) AI chipmaker Groq confirms $650M raise, re-staffs after Nvidia's $20B not-acqui-hire deal | TechCrunch
(00:52:47) SpaceX inks compute deal with Reflection AI, an open source AI lab | TechCrunch

Projects & Open Source
(00:54:46) GLM-5.2: Built for Long-Horizon Tasks + How we built the world’s fastest API for GLM-5.2 + nvidia/GLM-5.2-NVFP4 · Hugging Face
(01:03:04) EconEvals

Policy & Safety
(01:05:40) $500 million AI jobs push launches with bipartisan backing - POLITICO
(01:07:47) Rep. Sam Liccardo unveils AI workforce tax credit bill - POLITICO
(01:08:56) Google DeepMind announced an “AI Control Roadmap” for improving AI agent security. | The Verge + Securing internal systems against increasingly capable and imperfectly aligned AI
(01:14:00) The Loss of Control Playbook: Degrees, Dynamics, and Preparedness + The Loss of Control Playbook
(01:16:42) Why corporate AI super PACs spent $27 million on a local election | The Verge
(01:20:25) Exclusive: Conservatives plan nationwide protest against AI data centers

Research & Advancements
(01:27:37) Revisiting the Platonic Representation Hypothesis: An Aristotelian View
(01:31:39) Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models
(01:33:59) Tapered Language Models

Synthetic Media & Art
(01:36:54) Hollywood is bending the knee to OpenAI | The Verge
See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 44min
June 25, 2026#249 - Fable 5 ban, SpaceX Cursor + IPO, OSS Aplenty
Our 249th episode with a summary and discussion of last week's big AI news!
Recorded on 06/17/2026
Note: work has kept me from publishing episodes promptly, apologies! I'll get back on schedule soon.
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
In this episode:
Anthropic cut off access to Fable 5 and Mythos 5 after a US government order tied to alleged jailbreaks, prompting debate over inconsistent policy, export controls, and the practicality of preventing jailbreaks.
SpaceX completed an IPO at a roughly $1.75T valuation and then moved to acquire AI coding startup Cursor for $60B, positioning xAI with Cursor’s talent, data, and product to compete more effectively in coding.
Infrastructure and business updates include Anthropic pursuing direct US data center leases backed by Google, leaked documents showing OpenAI’s revenue growth alongside large losses, and chatbot market share shifting with ChatGPT below 50% as Gemini and Claude gain.
Projects and policy highlights include OpenRouter’s Fusion multi-model synthesis, new open releases from Moonshot, Qwen, and NVIDIA, DOJ support for xAI’s unpermitted gas turbines in Memphis, and a Munich court ruling Google liable for false AI Overview statements.

Timestamps (note - these don't take into account dynamically inserted ads and therefore may be off by a couple of minutes):
(00:00:10) Intro / Banter
(00:03:38) Ad break + news preview

Tools & Apps
(00:04:52) Anthropic cuts off Fable 5 and Mythos 5 access following government order | The Verge + All the news about Anthropic’s new AI fight with the White House
(00:25:53) Facebook’s new AI Mode search gets its info from public posts | The Verge

Applications & Business
(00:27:00) SpaceX to acquire the AI coding startup Cursor for $60 billion
(00:35:42) Anthropic pursues data center leases, seeks financial backing from Google, The Information reports | Reuters
(00:40:10) Leaked financial docs show OpenAI is losing billions of dollars a year - Ars Technica
(00:46:00) ChatGPT's market share slips below 50% for first time | TechCrunch
(00:50:34) ‘Tell Him He’s a Piece of Shit’: Meta’s New AI Unit Is a Total Mess | WIRED
(00:56:23) Sakana AI Commercializes AB-MCTS in Sakana Marlin, an Enterprise Agent Generating Up to 100-Page Research Reports With Slides - MarkTechPost

Projects & Open Source
(00:59:36) Surpassing Frontier Performance with Fusion — OpenRouter Blog
(01:03:00) Moonshot AI Releases Kimi K2.7-Code: a Coding Model Reporting +21.8% on Kimi Code Bench v2 Over K2.6 - MarkTechPost
(01:08:34) Meet Qwen-RobotSuite: Three Embodied AI Models for VLA Manipulation, Video World Modeling, and Navigation - MarkTechPost
(01:11:29) Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
(01:17:31) ProCUA-SFT Technical Report

Policy & Safety
(01:20:33) DOJ Lawyers Argue xAI Is ‘Vital’ for National Security in NAACP Lawsuit | WIRED + People Living Near xAI’s Dirty Data Centers Are Pissed About the SpaceX IPO
(01:25:29) A Court Has Ruled That Google Is Liable for False Statements Generated by AI Overviews | WIRED
(01:28:47) Why Do Naive SFT Filters For Safety Properties Fail?

Research & Advancements
(01:34:14) From AGI to ASI
(01:39:44) Artificial Analysis Intelligence Index v4.1: a shift toward agentic workloads
(01:42:12) SIA: Self Improving AI with Harness & Weight Updates

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 47min
June 17, 2026#248 - Fable 5, Siri AI, IPOs, Policy on the AI Exponential
Our 248th episode with a summary and discussion of last week's big AI news!
Recorded on 06/12/2026
Note: we recorded just before the OTHER big news about Fable... we'll discuss it on the next episode.
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
In this episode:
Anthropic released Claude Fable 5 (a safeguarded version of Mythos 5), showing major benchmark jumps and new risk findings in its system card (eval awareness, transgressive actions, CBRN concerns), alongside controversy over severe guardrails and silent downgrades.
Apple announced Siri AI at WWDC, positioning a more capable conversational assistant integrated across iPhone features, reportedly built on a custom Gemini partnership; Google also rolled out Gemini 3.5 Live Translate and cut Google AI Plus pricing while bundling more storage.
Business and infrastructure updates include OpenAI’s confidential IPO filing amid an IPO race with Anthropic and SpaceX, Bezos-backed Prometheus raising $12B for “physical AI,” DeepSeek seeking a major external round, and Google paying SpaceX about $920M/month for GPUs.
Open-source, safety, and policy developments feature new Gemma 4 and Diffusion Gemma releases, a lab letter urging DNA/RNA screening laws, Amodei calling for an FAA-like AI regulator and third-party testing, research on agent harms and RL “societal hacking,” and a dispute over music-label settlements with Suno/Udio.

Timestamps:
(00:00:10) Intro / Banter
(00:01:11) News Preview
(00:01:53) Sponsors

Tools & Apps
(00:04:53) Claude Fable 5 and Claude Mythos 5 + Anthropic apologizes for invisible Claude Fable guardrails
(00:27:06) Apple announces Siri AI and its next generation of Apple Intelligence | The Verge + I tried Siri AI, and so far it actually works
(00:33:47) Gemini 3.5 Live Translate rolling out to Google Meet and Translate
(00:35:39) Google just fired a warning shot in the AI subscription price wars | TechCrunch

Applications & Business
(00:37:55) OpenAI Confidentially Files for IPO on the Heels of SpaceX and Anthropic | WIRED
(00:41:57) Jeff Bezos's Prometheus raises $12B to build an 'artificial general engineer' for the physical world | TechCrunch
(00:45:39) DeepSeek slated to raise $7 billion in maiden funding round, sources say
(00:48:18) Huawei-led team claims it post-trained DeepSeek's 1.6-trillion-parameter model — 1,000 Ascend 910C chips used in training
(00:51:57) Google will pay SpaceX $920M per month for compute | TechCrunch
(00:55:51) Elon Musk Shows Off AI Data Centers SpaceX Wants to Send Into Space - Business Insider

Projects & Open Source
(01:01:14) Google's new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM - Ars Technica
(01:05:13) Google AI Releases DiffusionGemma, a 26B MoE Open Model Using Text Diffusion for Up to 4x Faster Generation - MarkTechPost

Policy & Safety
(01:09:42) OpenAI and Anthropic Sign Letter to Prevent AI-Developed Biological Weapons | WIRED
(01:14:04) Anthropic CEO publishes lengthy article: AI is moving too fast, and policies can't keep up. | PANews
(01:20:18) Anthropic Urges Global Pause in AI Development, Flags ‘Self-Improvement’ Risk - WSJ
(01:24:46) When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
(01:27:42) Large Language Models Hack Rewards, and Society
(01:33:46) Senior US officials eye government shares in AI giants

Synthetic Media & Art
(01:37:45) AFM Sues UMG, WMG Over Settlements With Suno and Udio

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 41min
June 06, 2026#247 - Opus 4.8, MAI, Anthropic IPO, Minimax-M3
Our 247th episode with a summary and discussion of last week's big AI news!
Recorded on 06/03/2026
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
In this episode:
Anthropic released Claude Opus 4.8 with improved benchmark scores, discussed eval-awareness findings and welfare/corrigibility themes from its system card, and introduced Dynamic Workflows for long-running multi-agent tasks.
Microsoft unveiled the always-on Microsoft Scout assistant built on OpenClaw plus new in-house MAI models (including MAI Thinking 1) and “frontier tuning,” emphasizing enterprise security architecture and model-from-scratch capability.
Major business moves included Anthropic’s $65B Series H at a $965B valuation alongside an IPO filing, a JPMorgan analysis arguing OpenAI needs major revenue growth to justify infrastructure spend, and Cognition raising $1B at a $25B valuation.
Policy and security highlights covered Trump’s voluntary pre-release government testing framework for powerful AI, Meta AI support being exploited to hijack Instagram accounts, tightened US Nvidia export controls and China’s travel approvals for AI experts, plus expanded Glasswing/Mythos-style cyber and biodefense initiatives.

Timestamps:
(00:00:10) Intro / Banter
(00:04:10) Sponsors
(00:07:10) News Preview

Tools & Apps
(00:07:54) Anthropic releases Opus 4.8 with new 'dynamic workflow' tool | TechCrunch
(00:22:37) Microsoft Scout is a new AI personal assistant built on OpenClaw | The Verge
(00:26:55) Microsoft launches new MAI family of AI models at Microsoft Build | Mashable
(00:37:43) Robinhood now lets your AI agents trade stocks | TechCrunch
(00:40:49) OpenAI launches new Codex tools for white-collar work | TechCrunch
(00:43:40) ElevenLabs' new music-generation model can switch genres mid-track | TechCrunch

Applications & Business
(00:44:35) Anthropic Hits $965 Billion Valuation, Surpassing OpenAI - WSJ
(00:45:32) Anthropic Files to Go Public, Setting Stage for Huge I.P.O. - The New York Times
(00:51:15) China’s ByteDance Developing New AI Chips Like Those from Nvidia Partner Groq
(00:55:00) Anthropic expands Mythos to 150 additional organizations
(00:55:35) OpenAI needs a 26x revenue increase to justify its buildout
(00:58:46) AI coding startup Cognition raises $1B at $25B pre-money valuation | TechCrunch

Projects & Open Source
(01:00:50) MiniMax-M3 debuts, eclipsing GPT-5.5 and Gemini 3.1 Pro on key benchmark performance for just 5-10% of the cost | VentureBeat

Policy & Safety
(01:06:08) Trump Signs Executive Order Seeking Oversight of A.I. Models - The New York Times
(01:11:45) Hackers Simply Asked Meta AI to Give Them Access to High-Profile Instagram Accounts. It Worked
(01:13:058) Chinese AI experts in private firms now required to secure approval before international travel — Beijing enforces policy to secure top-tier talent, expands measures beyond government
(01:17:53) U.S. Tightens Controls on Nvidia AI Chip Exports | Let's Data Science
(01:21:47) OpenAI launches Rosalind Biodefense, offers federal agencies early access to its life-sciences model
(01:24:00) Using LLMs to secure source code
(01:26:19) Project Glasswing: An initial update
(01:29:30) White House Approves $9 Billion for Spy Agencies to Catch Up on A.I.
(01:32:11) US Law Enforcement Warns of ‘Anti-Tech Extremism’ as AI Hatred Grows

Synthetic Media & Art
(01:35:38) YouTube will now automatically label AI videos | TechCrunch

Research & Advancements
(01:36:22) Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention
(01:41:26) From Simulation to Enaction: Post-trained language models recognize and react to their own generations
See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 46min
May 25, 2026#246 - Gemini 3.5 + Omni, Musk Loses, OpenAI vs Erdős
Our 246th episode with a summary and discussion of last week's big AI news!
Recorded on 05/22/2026
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
In this episode:
Google I/O highlights included Gemini 3.5 (with 3.5 Flash emphasized for speed and benchmarks), the always-on agent Gemini Spark running on Google Cloud with MCP tool support, and Gemini Omni multimodal video generation/editing, plus updates like Anti-Gravity 2.0, Gemini for Science, and Genie world-model navigation using Street View and Waymo simulation.
Coding-agent competition accelerated with Cursor Composer 2.5 (fine-tuned on Moonshot’s Kimi K2.5) and xAI’s early Grok Build release, alongside discussion of potential Cursor–xAI ties and xAI’s talent churn and compute utilization concerns.
Business and legal updates included Elon Musk losing his OpenAI lawsuit on statute-of-limitations grounds, reported OpenAI–Apple partnership tensions, Anthropic agreeing to a $30B funding round at a $900B valuation and projecting its first profitable quarter, and Cerebras’ IPO surging about 90%.
Research and safety stories covered OpenAI’s result on an 80-year-old Erdős geometry problem, findings on “negation neglect” in training, interpretability work showing multiple redundant circuits per capability, agent benchmarks like Terminal World, new deepfake takedown enforcement under the Take It Down Act, demonstrations of autonomous hacking/self-replication, rapidly improving AI cyber capabilities, and steps toward image provenance metadata and watermarks.

Timestamps:
(00:00:10) Intro / Banter
(00:01:15) News Preview

Tools & Apps
(00:05:05) Google unveils AI model Gemini 3.5 and AI agent Gemini Spark
(00:11:43) Google's Gemini Omni turns images, audio, and text into video — and that's just the start | TechCrunch
(00:17:27) Google launches Antigravity 2.0 with an updated desktop app and CLI tool at IO 2026 | TechCrunch
(00:22:35) Google Debuts AI-Powered Tools To Optimize Scientific Research Workflows
(00:27:20) Google’s Genie world model can now simulate real streets with Street View | TechCrunch
(00:29:51) Cursor's Composer 2.5 matches Opus 4.7 and GPT-5.5 benchmarks at a fraction of the cost
(00:37:37) xAI Introduces Its Coding Agent Called Grok Build

Applications & Business
(00:41:55) Musk loses OpenAI court battle as he waited too long to sue
(00:48:08) Anthropic agrees terms of $30bn funding deal at $900bn valuation
(00:53:12) OpenAI co-founder Andrej Karpathy joins Anthropic's pre-training team | TechCrunch
(00:56:49) Greg Brockman Officially Takes Control of OpenAI’s Products in Latest Shake-Up | WIRED
(00:58:15) OpenAI-Apple Partnership Frays, Setting Up Possible Legal Fight - Bloomberg
(01:01:13) AI chipmaker Cerebras soars 90% in year’s biggest IPO so far

Research & Advancements
(01:07:10) AI just solved an 80-year-old ‘Erdős problem,’ and mathematicians are amazed | Scientific American
(01:11:50) Negation Neglect: When models fail to learn negations in training
(01:13:18) All Circuits Lead to Rome: Rethinking Functional Anisotropy in Circuit and Sheaf Discovery for LLMs
(01:16:20) Autonomous AI research for nanogpt speedrun
(01:21:59) TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks

Policy & Safety
(01:23:15) America’s dangerous, messy deepfakes crackdown is here | The Verge
(01:25:17) Language Models Can Autonomously Hack and Self-Replicate
(01:28:48) How fast is autonomous AI cyber capability advancing?
(01:31:32) Positive Alignment: Artificial Intelligence for Human Flourishing

Synthetic Media & Art
(01:33:15) OpenAI is making it easier to check if an image was made by their models | TechCrunch
(01:33:56) How Chinese short dramas became AI content machines | MIT Technology Review
See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 34min
May 18, 2026#245 - TML-Interaction, Claude For Legal, Sam Altman on Stand
Our 245th episode with a summary and discussion of last week's big AI news!
Recorded on 05/13/2026
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
In this episode:
OpenAI released new voice intelligence API features including GPT Realtime 2 (GPT-5-powered) plus realtime translation and Whisper transcription, emphasizing the latency–reasoning tradeoff, larger context, and new guardrails amid fraud risks.
Thinking Machines previewed a low-latency, full‑duplex conversational system with a two-model architecture and custom inference stack, reporting strong interactivity benchmark results but without public access or third‑party validation yet.
Anthropic pushed further into vertical products with Claude for Legal and deeper AWS availability, while ongoing ecosystem tension grows as platform model providers compete with application-layer companies.
Safety, policy, and research updates included OpenAI’s self-harm trusted contact feature, Anthropic work on reducing agent misalignment by training ethical “why” reasoning, OpenAI’s investigation of accidental chain-of-thought grading in RL, and Meta horizon eval updates showing benchmarking limits for long task horizons.

Timestamps:
(00:00:10) Intro / Banter
(00:01:35) Response to listener comments
(00:03:27) Sponsor Break
Tools & Apps
(00:06:27) OpenAI launches new voice intelligence features in its API | TechCrunch
(00:15:52) Thinking Machines drops a new, highly responsive model designed for humanlike interactions in real time - SiliconANGLE
(00:27:49) Claude For Legal Launches, May Reshape the Legal Tech World – Artificial Lawyer
(00:40:27) Threads tests a Meta AI integration that works similarly to Grok | TechCrunch
(00:43:08) Google brings agentic AI and vibe-coded widgets to Android | TechCrunch
(00:45:33) Google updates AI search to include quotes from Reddit and other sources | TechCrunch
Applications & Business
(00:47:38) Sam Altman was winning on the stand, but it might not be enough | The Verge
(00:55:04) Nvidia C.E.O. Jensen Huang Hitches Ride With Trump to China After Last-Minute Invite - The New York Times
(00:58:40) AWS expands Anthropic partnership with Claude Platform launch
(01:01:13) Chinese grey market sells Claude API access at 90% off by using stolen credentials, model substitution, and harvesting users' prompts and outputs for resale as AI training data — 'transfer stations' operate through proxy networks that harvest user data
(01:06:43) DeepMind Spinout Isomorphic Labs Raises $2.1 Billion to Design Drugs With AI - Bloomberg
Projects & Open Source
(01:09:04) Petri: Anthropic Hands Its Alignment Toolbox to Meridian Labs with 3.0 Update
(01:12:25) Daybreak': OpenAI's Answer to Anthropic's Project Glasswing Has Arrived
Policy & Safety
(01:14:04) Teaching Claude why
(01:21:45) Import AI 455: Automating AI Research
(01:28:31) ChatGPT's New Safety Feature Could Alert 'Trusted Contact' to Risk of Self-Harm - CNET
(01:30:09) Investigating the consequences of accidentally grading CoT during RL
(01:34:46) Natural Language Autoencoders criticism
(01:39:15) Review of the "Risks from automated R&D" section in the Anthropic Risk Report (February 2026)
Synthetic Media & Art
(01:43:39) George Clooney, Tom Hanks, and Meryl Streep back new ‘Human Consent Standard’ for AI licensing | The Verge
Research & Advancements
(01:45:10) METR says Claude Mythos is testing the limits of AI evaluation – Startup Fortune
See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 50min
May 11, 2026#244 - GPT-5.5 Instant, Grok 4.3, OpenAI vs Musk
Our 244th episode with a summary and discussion of last week's big AI news!
Recorded on 05/08/2026
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
In this episode:
OpenAI released GPT-5.5 Instant as ChatGPT’s new default model, showing large benchmark gains and crossing a “high” cyber-risk threshold under its preparedness framework, while bio-safety results were mixed.
OpenAI investigated and patched ChatGPT’s “goblin” obsession, attributing it to reinforcement-learning rewards that over-amplified playful creature metaphors in a nerdy persona that later bled across versions.
Major industry moves included xAI’s Grok 4.3 price cuts and voice tools, Mistral’s unified Medium 3.5 model and Work mode, and Anthropic’s managed-agent upgrades alongside a surprise SpaceX compute deal and reports of a much higher Anthropic valuation.
Key policy and security developments covered the Musk–OpenAI trial details, Pentagon AI deployments on classified networks, expanded U.S. government pre-release model reviews, and reports of NSA testing Anthropic’s Mythos on Microsoft software.

Timestamps:
(00:00:10) Intro / Banter
(00:01:14) News Preview
(00:04:39) Response to listener comments

Tools & Apps
(00:13:40) OpenAI releases GPT-5.5 Instant, a new default model for ChatGPT | TechCrunch
(00:18:23) ChatGPT Became So Obsessed With Goblins That OpenAI Had to Intervene
(00:27:14) xAI launches Grok 4.3 at an aggressively low price and a new, fast, powerful voice cloning suite | VentureBeat
(00:33:49) Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model
(00:39:28) Anthropic updates Claude Managed Agents with three new features - 9to5Mac
(00:43:42) ElevenLabs Revamps AI Music Platform as Fan-Focused Service

Applications & Business
(00:44:57) A diary, a threat, and a $30 billion stake: What the Musk vs OpenAI trial has actually shown in its first week - The Times of India
(00:55:28) Anthropic, SpaceX Sign Deal to Boost AI Computing Power for Claude Software - Bloomberg
(01:01:48) Anthropic in talks with investors to raise funds at $900 billion valuation, higher than OpenAI
(01:02:37) Anthropic and OpenAI are both launching joint ventures for enterprise AI services | TechCrunch
(01:06:15) Anthropic and FIS Are Building an AI Agent to Help Banks Police Financial Crimes
(01:07:02) AMD’s revenue jumps 38 percent from last year as Q1 data center sales hit $5.8 billion. | The Verge
(01:08:51) Banks seek to offload risk to avoid ‘choking’ on data centre debt
(01:14:08) DeepSeek could be valued at up to $50 billion in first fundraising, sources say | Reuters

Projects & Open Source
(01:16:14) Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations
(01:22:23) OpenAI just open-sourced its data center networking technology

Policy & Safety
(01:25:02) Pentagon inks deals with Nvidia, Microsoft, and AWS to deploy AI on classified networks | TechCrunch
(01:27:27) Google, Microsoft, and xAI will allow the US government to review their new AI models | The Verge
(01:32:11) NSA Testing Anthropic’s Mythos to Find Flaws in Microsoft Tech
(01:35:42) Introspection Adapters: Training LLMs to Report Their Learned Behaviors

Research & Advancements
(01:41:18) Recursive Multi-Agent Systems
(01:51:47) Frontier Coding Agents Can Now Implement an AlphaZero Self-Play Machine Learning Pipeline For Connect Four That Performs Comparably to an External Solver
See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 56min
May 03, 2026#243 - GPT 5.5, DeepSeek V4, AI safety sabotage
Our 243rd episode with a summary and discussion of last week's big AI news!
Recorded on 04/29/2026
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
In this episode:
OpenAI released GPT-5.5 with strong coding-oriented improvements, a system card discussing chain-of-thought monitorability and misalignment testing, higher pricing than GPT-5.4, and notable quirks like a system-prompt warning about “goblins.”
xAI launched Grok Voice Think Fast 1.0, claiming large benchmark leads for real-time voice agents and reporting major Starlink customer-support automation and sales conversion impact.
DeepSeek open-sourced DeepSeek V4 (Pro and Flash) featuring MoE scaling and 1M-token context via hybrid/compressed attention changes, while Tencent released Hunyuan 3 preview with weaker benchmark performance; a new long-horizon agent benchmark (Clawmark) shows low task success rates.
Major business, legal, and policy updates include Google’s planned up-to-$40B investment and 5GW compute commitment to Anthropic, Meta’s AWS Gravitron deal and China blocking Meta’s Manus acquisition, a revamped OpenAI–Microsoft agreement, ongoing Musk–OpenAI trial developments, and new safety/security research on sabotage, document degradation under delegation, and bit-flip attacks.

Timestamps:
(00:00:10) Intro / Banter
(00:02:00) News Preview
(00:02:26) Response to listener comments
(00:02:55) Sponsors

Tools & Apps
(00:05:55) OpenAI Unveils Its New, More Powerful GPT-5.5 Model - The New York Times
(00:23:33) xAI Launches grok-voice-think-fast-1.0: Topping τ-voice Bench at 67.3%, Outperforming Gemini, GPT Realtime, and More - MarkTechPost
(00:29:00) Claude can now plug directly into Photoshop, Blender, and Ableton | The Verge

Projects & Open Source
(00:29:38) China's DeepSeek releases preview of long-awaited V4 model as AI race intensifies
(00:47:05) Tencent Unveils Hy3 preview; Model Enhances Agent Capabilities and Real-World Usability - Tencent 腾讯
(00:50:14) ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Applications & Business
(00:53:03) Google Plans to Invest Up to $40 Billion in Anthropic
(00:56:26) Meta will use hundreds of thousands of AWS Graviton chips
(00:59:51) China blocks Meta's $2 billion takeover of AI startup Manus
(01:01:45) OpenAI shakes up partnership with Microsoft, capping revenue share payments
(01:07:13) Elon Musk Testifies of AI Risk at Trial, Says OpenAI Tried to ‘Steal’ a Charity - WSJ
(01:11:50) Judge rejects DOJ bid to delay Anthropic appeal in Pentagon dispute
(01:14:42) Google’s Gemini can now run on a single air-gapped server — and vanish when you pull the plug
(01:19:07) DeepMind's David Silver just raised $1.1B to build an AI that learns without human data | TechCrunch

Policy & Safety
(01:22:47) Evaluating whether AI models would sabotage AI safety research
(01:28:59) LLMs Corrupt Your Documents When You Delegate
(01:32:50) Temporal Sparse Autoencoders: Leveraging the Sequential Nature of Language for Interpretability
(01:39:53) Memorandum on Adversarial Distillation of American AI Models
(01:41:41) Teen boys are dating their AI chatbots—and experts warn it could kill their careers | Fortune
(01:43:57) Announcing the Anthropic Economic Index Survey
(01:45:21) Scoop: CISA lacks access to Anthropic's Mythos

Synthetic Media & Art
(01:48:03) Taylor Swift Files to Trademark Voice and Likeness to Protect Against AI Misuse

Research & Advancements
(01:49:15) Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 53min
April 29, 2026#242 - ChatGPT Images 2.0, Qwen 3.6 Max, Kimi-K2.6
Our 242nd episode with a summary and discussion of last week's big AI news!
Recorded on 04/22/2026
Hosted by Andrey Kurenkov and Jeremie Harris
Feel free to email us your questions and feedback at [email protected] and/or [email protected]
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
In this episode:
OpenAI released a new ChatGPT image model that excels at accurate text and screenshot-like generations, suggesting a transformer-style approach aligned with agentic “computer use” ambitions.
Chinese model activity accelerated with Alibaba’s Qwen 3.6 Max Preview moving to an API-only offering, plus open releases from Moonshot AI (Kimi K2.6, a 1T-parameter MoE) and Minimax (Minimax M 2.7) showing strong benchmark results.
Google expanded Deep Research with a “Max” option built on Gemini 3.1 Pro and MCP support for accessing proprietary data, while Mozilla reported using Anthropic’s Claude to find and fix 271 Firefox bugs.
Business and policy updates include a reported SpaceX–Cursor deal with a $60B buy option, Cerebras filing for an IPO, Amazon adding $5B to Anthropic alongside a $100B AWS spending pledge, and platform responses to synthetic media like AI music spam and YouTube deepfake takedown requests.

Timestamps:
(00:00:10) Intro / Banter
(00:01:05) News Preview
(00:01:41) Sponsors
(00:04:41) Response to listener comments

Tools & Apps
(00:09:40) ChatGPT's new Images 2.0 model is surprisingly good at generating text | TechCrunch
(00:16:02) Alibaba Drops Qwen 3.6 Max Preview—Its Most Powerful Model Yet - Decrypt
(00:19:26) Google launches Deep Research and Deep Research Max agents to automate complex research
(00:25:00) Mozilla Used Anthropic’s Mythos to Find and Fix 271 Bugs in Firefox | WIRED
(00:28:35) Ordering with the Starbucks ChatGPT app was a true coffee nightmare | The Verge

Applications & Business
(00:29:48) SpaceX is working with Cursor and has an option to buy the startup for $60B | TechCrunch
(00:34:11) AI chip startup Cerebras files for IPO | TechCrunch
(00:38:23) Two startups want to replace how AI learns: one just raised $180M, another is seeking up to $1B
(00:38:56) Months-old start-up Recursive Superintelligence raises $500mn for self-teaching AI
(00:41:36) Anthropic takes $5B from Amazon and pledges $100B in cloud spending in return | TechCrunch
(00:45:09) Kevin Weil and Bill Peebles exit OpenAI as company continues to shed 'side quests' | TechCrunch
(00:46:04) Meta hires five Thinking Machines Lab founders including a reported $1.5 billion engineer - Meta cuts 198 Bay Area jobs as even larger layoffs reportedly loom
(00:50:12) Meta employees are up in arms over a mandatory program to train AI on their mouse movements and keystrokes
(00:51:43) Chinese fabs import record volumes of US chipmaking equipment via Singapore and Malaysia — homegrown tool makers booked record 2025 revenues as price competition squeezes margins
(00:54:01) Google Eyes New Chips to Speed Up AI Results, Challenging Nvidia
(00:54:20) Canadian quantum company Xanadu soars to $16 billion valuation after Nvidia release

Projects & Open Source
(01:00:13) Moonshot AI releases Kimi-K2.6 model with 1T parameters, attention optimizations - SiliconANGLE
(01:05:22) MiniMax Just Open Sourced MiniMax M2.7: A Self-Evolving Agent Model that Scores 56.22% on SWE-Pro and 57.0% on Terminal Bench 2 - MarkTechPost

Policy & Safety
(01:06:25) Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions
(01:10:25) Scoop: NSA using Anthropic's Mythos despite blacklist
(01:11:03) Unauthorized group has gained access to Anthropic’s exclusive cyber tool Mythos, report claims

Research & Advancements
(01:17:21) Parcae: Scaling Laws For Stable Looped Language Models
(01:24:20) OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language Environment Simulation

Synthetic Media & Art
(01:27:01) Deezer says 44% of songs uploaded to its platform daily are AI-generated | TechCrunch
(01:29:47) Celebrities will be able to find and request removal of AI deepfakes on YouTube | The Verge
See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
...more
1h 31min

FAQs about Last Week in AI:

How many episodes does Last Week in AI have?

The podcast currently has 293 episodes available.

More shows like Last Week in AI

The a16z Show by Andreessen Horowitz

The a16z Show

1,093 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

301 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

345 Listeners

Practical AI by Daniel Whitenack and Chris Benson

Practical AI

208 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

99 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

576 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

508 Listeners

The Artificial Intelligence Show by Paul Roetzer and Mike Kaput

The Artificial Intelligence Show

212 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

143 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

101 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

226 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

682 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

111 Listeners

A Beginner's Guide to AI by Dietmar Fischer

A Beginner's Guide to AI

56 Listeners

The Next Wave - AI and The Future of Technology by Mindstream (Hubspot Media)

The Next Wave - AI and The Future of Technology

54 Listeners