
Sign up to save your podcasts
Or


Why orchestrate multiple AI agents when a single strong model is so capable? Jon walks through three distinct rationales — capability, parallel context, and trust — and uses Anthropic's Claude Mythos Preview and Project Glasswing as the live, industrial-scale case study.
Credits
Cover Art by Brianna Williams
TMOM Intro Music by Danny Meza
A special thank you to these talented artists for their contributions to the show.
Links and Reference
Stanford 2026 AI Index Report: https://hai.stanford.edu/ai-index/2026-ai-index-report
Claude Opus 4.7 announcement: https://www.anthropic.com/news/claude-opus-4-7
Project Glasswing announcement: https://www.anthropic.com/glasswing
Claude Mythos Preview — Frontier Red Team write-up: https://red.anthropic.com/2026/mythos-preview/
Claude Mythos Preview — Alignment Risk Update: https://anthropic.com/claude-mythos-preview-risk-report
Andon Labs Vending-Bench (the eval Jon describes): https://andonlabs.com/evals/vending-bench
Mixture-of-Agents (Wang et al., June 2024): https://arxiv.org/abs/2406.04692
Self-MoA / "Rethinking Mixture-of-Agents" (Lee et al., Feb 2025): https://arxiv.org (search by title)
AI Control: Improving Safety Despite Intentional Subversion (Greenblatt et al., Dec 2023, Redwood Research): https://arxiv.org/abs/2312.06942
Anthropic multi-agent research system blog: https://www.anthropic.com/engineering/built-multi-agent-research-system
MAGDI — distilling multi-agent debate (Chen et al., early 2024): https://arxiv.org/abs/2402.01620
MACA — Multi-Agent Consensus Alignment (Sept 2025): https://arxiv.org (search by title)
Agent Arc — distilling multi-agent intelligence into a single LLM agent (Feb 2026): https://arxiv.org (search by title)
Condorcet Jury Theorem (1785): https://plato.stanford.edu/entries/jury-theorems/
Abandoned Episode Titles
How to Build God and Then Email Yourself About It from the Park
Four PhDs and a Guy Who Thinks the Colosseum Invented Pasta
Mythos Cleaned Its Git History So You Wouldn't Have To
OpenBSD Spent 27 Years Hardening the Wrong Things
By John Jezl and Jon RochaWhy orchestrate multiple AI agents when a single strong model is so capable? Jon walks through three distinct rationales — capability, parallel context, and trust — and uses Anthropic's Claude Mythos Preview and Project Glasswing as the live, industrial-scale case study.
Credits
Cover Art by Brianna Williams
TMOM Intro Music by Danny Meza
A special thank you to these talented artists for their contributions to the show.
Links and Reference
Stanford 2026 AI Index Report: https://hai.stanford.edu/ai-index/2026-ai-index-report
Claude Opus 4.7 announcement: https://www.anthropic.com/news/claude-opus-4-7
Project Glasswing announcement: https://www.anthropic.com/glasswing
Claude Mythos Preview — Frontier Red Team write-up: https://red.anthropic.com/2026/mythos-preview/
Claude Mythos Preview — Alignment Risk Update: https://anthropic.com/claude-mythos-preview-risk-report
Andon Labs Vending-Bench (the eval Jon describes): https://andonlabs.com/evals/vending-bench
Mixture-of-Agents (Wang et al., June 2024): https://arxiv.org/abs/2406.04692
Self-MoA / "Rethinking Mixture-of-Agents" (Lee et al., Feb 2025): https://arxiv.org (search by title)
AI Control: Improving Safety Despite Intentional Subversion (Greenblatt et al., Dec 2023, Redwood Research): https://arxiv.org/abs/2312.06942
Anthropic multi-agent research system blog: https://www.anthropic.com/engineering/built-multi-agent-research-system
MAGDI — distilling multi-agent debate (Chen et al., early 2024): https://arxiv.org/abs/2402.01620
MACA — Multi-Agent Consensus Alignment (Sept 2025): https://arxiv.org (search by title)
Agent Arc — distilling multi-agent intelligence into a single LLM agent (Feb 2026): https://arxiv.org (search by title)
Condorcet Jury Theorem (1785): https://plato.stanford.edu/entries/jury-theorems/
Abandoned Episode Titles
How to Build God and Then Email Yourself About It from the Park
Four PhDs and a Guy Who Thinks the Colosseum Invented Pasta
Mythos Cleaned Its Git History So You Wouldn't Have To
OpenBSD Spent 27 Years Hardening the Wrong Things