Embodied AI 101

MolmoAct2-LIBERO: An Open Vision-Language-Action Model for Robotics


Listen Later

Vision-Language-Action (VLA) model fine-tuned on the merged LIBERO robotics dataset (1,693 episodes, 273k+ frames) achieving 98.25% success rate on manipulation tasks. Released with both checkpoint and dataset for VLA finetuning.
...more
View all episodesView all episodes
Download on the App Store

Embodied AI 101By Shaoqing Tan