May 05, 2026

Two Minds, Lower Trust

52 minutes

Why orchestrate multiple AI agents when a single strong model is so capable? Jon walks through three distinct rationales — capability, parallel context, and trust — and uses Anthropic's Claude Mythos Preview and Project Glasswing as the live, industrial-scale case study.

Credits

Cover Art by Brianna Williams

TMOM Intro Music by Danny Meza

A special thank you to these talented artists for their contributions to the show.

Links and Reference

Stanford 2026 AI Index Report: https://hai.stanford.edu/ai-index/2026-ai-index-report
Claude Opus 4.7 announcement: https://www.anthropic.com/news/claude-opus-4-7
Project Glasswing announcement: https://www.anthropic.com/glasswing
Claude Mythos Preview — Frontier Red Team write-up: https://red.anthropic.com/2026/mythos-preview/
Claude Mythos Preview — Alignment Risk Update: https://anthropic.com/claude-mythos-preview-risk-report
Andon Labs Vending-Bench (the eval Jon describes): https://andonlabs.com/evals/vending-bench
Mixture-of-Agents (Wang et al., June 2024): https://arxiv.org/abs/2406.04692
Self-MoA / "Rethinking Mixture-of-Agents" (Lee et al., Feb 2025): https://arxiv.org (search by title)
AI Control: Improving Safety Despite Intentional Subversion (Greenblatt et al., Dec 2023, Redwood Research): https://arxiv.org/abs/2312.06942
Anthropic multi-agent research system blog: https://www.anthropic.com/engineering/built-multi-agent-research-system
MAGDI — distilling multi-agent debate (Chen et al., early 2024): https://arxiv.org/abs/2402.01620
MACA — Multi-Agent Consensus Alignment (Sept 2025): https://arxiv.org (search by title)
Agent Arc — distilling multi-agent intelligence into a single LLM agent (Feb 2026): https://arxiv.org (search by title)
Condorcet Jury Theorem (1785): https://plato.stanford.edu/entries/jury-theorems/

Abandoned Episode Titles

How to Build God and Then Email Yourself About It from the Park

Four PhDs and a Guy Who Thinks the Colosseum Invented Pasta

Mythos Cleaned Its Git History So You Wouldn't Have To

OpenBSD Spent 27 Years Hardening the Wrong Things

...more

View all episodes

By John Jezl and Jon Rocha

May 05, 2026

Two Minds, Lower Trust

52 minutes

Credits

Cover Art by Brianna Williams

TMOM Intro Music by Danny Meza

A special thank you to these talented artists for their contributions to the show.

Links and Reference

Stanford 2026 AI Index Report: https://hai.stanford.edu/ai-index/2026-ai-index-report
Claude Opus 4.7 announcement: https://www.anthropic.com/news/claude-opus-4-7
Project Glasswing announcement: https://www.anthropic.com/glasswing
Claude Mythos Preview — Frontier Red Team write-up: https://red.anthropic.com/2026/mythos-preview/
Claude Mythos Preview — Alignment Risk Update: https://anthropic.com/claude-mythos-preview-risk-report
Andon Labs Vending-Bench (the eval Jon describes): https://andonlabs.com/evals/vending-bench
Mixture-of-Agents (Wang et al., June 2024): https://arxiv.org/abs/2406.04692
Self-MoA / "Rethinking Mixture-of-Agents" (Lee et al., Feb 2025): https://arxiv.org (search by title)
AI Control: Improving Safety Despite Intentional Subversion (Greenblatt et al., Dec 2023, Redwood Research): https://arxiv.org/abs/2312.06942
Anthropic multi-agent research system blog: https://www.anthropic.com/engineering/built-multi-agent-research-system
MAGDI — distilling multi-agent debate (Chen et al., early 2024): https://arxiv.org/abs/2402.01620
MACA — Multi-Agent Consensus Alignment (Sept 2025): https://arxiv.org (search by title)
Agent Arc — distilling multi-agent intelligence into a single LLM agent (Feb 2026): https://arxiv.org (search by title)
Condorcet Jury Theorem (1785): https://plato.stanford.edu/entries/jury-theorems/

Abandoned Episode Titles

How to Build God and Then Email Yourself About It from the Park

Four PhDs and a Guy Who Thinks the Colosseum Invented Pasta

Mythos Cleaned Its Git History So You Wouldn't Have To

OpenBSD Spent 27 Years Hardening the Wrong Things

...more

Share Two Minds, Lower Trust

Sign up to save your podcasts

Two Minds, Lower Trust

Two Minds, Lower Trust