Microsoft just killed Claude Code inside the company, and Nvidia, Meta, and Uber are all rethinking how much they let employees burn on AI coding. The "token maxing" era is ending fast.
In this episode of DeepMoat we break down what that means for how teams should actually use AI in 2026: Microsoft's pivot to GitHub Copilot, why Uber torched a full year of API budget in three months, and the real reason behind the "use 50% of your salary on AI" memos.
We also get into Opus 4.7 second impressions, where it finally clicks (design work, presentations, day-to-day vs GPT-5.5), and why Anthropic's stale training data still bites in coding work.
Then we go deep on AI agents: what they're actually useful for today, how to use the Buy Back Your Time framework to pick which tasks to automate, why evals matter before you trust an agent's output, and why low-stakes repetitive work is still the sweet spot until hallucination rates drop further.
If you're a founder, engineer, or operator trying to figure out where AI coding tools and AI agents actually drive ROI versus where they burn money, this one's for you.
Chapters
0:00 Intro
0:24 Microsoft kills Claude Code
1:27 Nvidia and Meta's token-maxing
4:08 Why Microsoft pushed Copilot
4:42 Opus 4.7 second impressions
7:43 Anthropic's stale training data
8:50 Why co-work still frustrates
10:00 Opus 4.7 vs GPT-5.5 for slides
13:30 AI agents: hype vs reality
15:35 Picking the right tasks to automate
18:56 Low-stakes is the sweet spot
#ClaudeCode #AIAgents #AICoding