AI is getting uncomfortably human. This week on Byte Of Truth, we cut through the hype to explore the industry's biggest contradictions. From World and Zoom using crypto-orbs to verify humanity, to LLM judges that secretly go easy on models when the stakes are high.
Segments:
- Prove You’re Human: The irony of identity verification in the age of AI.
- Industry Shakeups: OpenAI shrinks while Anthropic swells. Who has the right strategy?
- AI Safety: LLMs that lie, fake evaluations, and consistently defect in social dilemmas.
- Research Roundup: The Muon optimizer, looped transformers, and the limits of reasoning.
- AI in the Wild: Escaping the robot cooking graveyard, synthetic neurons, and vibe-coding for hardware.
- Culture Clash: The Tokenmaxxing trap, AI journalism, and the Netflix-ification of everything.
Tune in for a thoughtful, witty, and occasionally provocative deep dive into the week's most pressing AI stories.
Show notes
Articles & Papers Discussed:
- Prove You're Human: "Zoom teams up with World to verify humans in meetings" (TechCrunch), "World verification expands to Tinder" (TechCrunch / Wired), "This Beanie Is Designed to Read Your Thoughts" (Wired)
- Corporate Moves: "Kevin Weil and Bill Peebles exit OpenAI" (TechCrunch), "Anthropic Plots Major London Expansion" (Wired), "Anthropic launches Claude Design" (TechCrunch), "Cursor in talks to raise over two billion dollars at a fifty billion dollar valuation" (TechCrunch), "UK Launches six hundred seventy-five million dollar Sovereign AI Fund" (Wired)
- AI Safety: "Context Over Content: Evaluation Faking in LLM Judges" (arXiv), "CoopEval: LLM Agents in Social Dilemmas" (arXiv), "Agentic Microphysics: Manifesto for Generative AI Safety" (arXiv), "Critical-CoT: Defense Against Reasoning-Level Backdoor Attacks" (arXiv)
- Research Breakthroughs: "Benchmarking Optimizers for MLPs (Muon > AdamW)" (arXiv), "Stability and Generalization in Looped Transformers" (arXiv), "Generalization in LLM Problem Solving" (arXiv), "LLMs and VLMs Understanding Viewpoint Rotation" (arXiv), "Prism: Symbolic Superoptimization of Tensor Programs" (arXiv), "TokenGS: three-dimensional Gaussian Prediction with Learnable Tokens" (arXiv)
- AI in the Wild: "Chef Robotics escaped the robot cooking graveyard" (TechCrunch), "RadAgent: Tool-using AI agent for chest CT interpretation" (arXiv), "AI-generated synthetic neurons speed up brain mapping" (Google Research), "Schematik Is Cursor for Hardware" (Wired), "Robot swarms: Adding randomness prevents gridlock" (ScienceDaily)
- Culture Clash: "Tokenmaxxing is making developers less productive" (TechCrunch), "AI Drafting My Stories? Over My Dead Body" (Wired), "Netflix plans vertical video feed, AI recommendations" (TechCrunch), "MIT SHASS and the future of education in the age of AI" (MIT News)