Embodied AI 101

CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models


Listen Later

Closed-loop framework coupling Vision-Language Models with Video Generation Models at step-level granularity. Mitigates long-horizon drift and mid-clip errors in goal-directed video reasoning for robotic planning.
...more
View all episodesView all episodes
Download on the App Store

Embodied AI 101By Shaoqing Tan