Sign up to save your podcastsEmail addressPasswordRegisterOrContinue with GoogleAlready have an account? Log in here.
FAQs about Embodied AI 101:How many episodes does Embodied AI 101 have?The podcast currently has 89 episodes available.
May 19, 2026CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation ModelsClosed-loop framework coupling Vision-Language Models with Video Generation Models at step-level granularity. Mitigates long-horizon drift and mid-clip errors in goal-directed video reasoning for robotic planning....more43minPlay
May 19, 2026World Action Models: The Next Frontier in Embodied AIFirst systematic survey defining World Action Models (WAMs) as embodied foundation models that jointly predict future states and generate actions. Covers architectures, data ecosystems, and evaluation protocols....more37minPlay
May 18, 2026Training a Whole-Body Control Foundation ModelDescribes end-to-end learning of a foundation model for adaptive whole-body humanoid control via massive simulation variation. Combines proprioceptive perception and policy adaptation across embodiments....more40minPlay
May 18, 2026DexJoCo: A Unified Benchmark for Task-Oriented Dexterous ManipulationReleases an open-source MuJoCo-based benchmark with 11 dexterous tasks, low-cost teleoperation hardware, and 1.1K human demonstrations. Designed to evaluate and train modern VLA/robotic policies....more44minPlay
May 18, 2026MMSkills: Building Multimodal Skill Libraries for Visual AgentsSkill library, demonstrations, and dataset for multi-modal robotic skill learning and manipulation tasks....more20minPlay
May 18, 2026PhysBrain 1.0 VLA (TwinBrainVLA): Dual-Brain Vision-Language-Action with Physics-Grounded LearningIntroduces dual-brain fusion Vision-Language-Action model with LangForce physics-grounded training methodology....more26minPlay
May 17, 2026MolmoAct2-LIBERO: An Open Vision-Language-Action Model for RoboticsVision-Language-Action (VLA) model fine-tuned on the merged LIBERO robotics dataset (1,693 episodes, 273k+ frames) achieving 98.25% success rate on manipulation tasks. Released with both checkpoint and dataset for VLA finetuning....more39minPlay
May 17, 2026SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Diffusion TransformersA 2.6B-parameter open-source world model that generates coherent 720p, minute-long videos with precise 6-DoF camera control on a single GPU using a Hybrid Linear Diffusion Transformer + Gated DeltaNet for long-context efficiency. Targets controllable physics simulation....more21minPlay
May 17, 2026WildClawBench: A Real-World, Long-Horizon Benchmark for AI AgentsNew benchmark and dataset for robotic manipulation in unconstrained 'wild' environments. Includes standardized containers, leaderboards, and evaluation protocols for cross-embodiment policies....more33minPlay
May 17, 2026MCP-Cosmos: Bring Your Own World ModelIntroduces a latent-space world model framework that lets agents simulate state transitions and iteratively refine plans before real-world execution. Evaluated on 20+ MCP-Bench tasks with measurable gains in tool-use success....more25minPlay
FAQs about Embodied AI 101:How many episodes does Embodied AI 101 have?The podcast currently has 89 episodes available.