Embodied AI 101

CaP-X: Coding Agents for Physical eXecution


Listen Later

CaP-X is an open-source agentic robotics framework where LLMs/VLMs generate code to call perception and control APIs for execution across diverse simulated and real robots in CaP-Gym's 187 manipulation tasks. The framework includes CaP-Bench for evaluating frontier models and CaP-RL, which boosts a 7B model's success from 20% to 72% with minimal sim-to-real gap.
...more
View all episodesView all episodes
Download on the App Store

Embodied AI 101By Shaoqing Tan