March 24, 2026

MolmoBot: A Vision-Language Model for Zero-Shot Robot Manipulation

38 minutes

Vision-language model (VLM) for zero-shot robot manipulation, trained entirely in simulation without real-world data; achieves 79.2% success rate on real-world tabletop tasks, outperforming π₀.₅ baseline at 39.2%.

...more

View all episodes

By Shaoqing Tan

March 24, 2026

MolmoBot: A Vision-Language Model for Zero-Shot Robot Manipulation

38 minutes

...more

Share MolmoBot: A Vision-Language Model for Zero-Shot Robot Manipulation

Sign up to save your podcasts

MolmoBot: A Vision-Language Model for Zero-Shot Robot Manipulation

MolmoBot: A Vision-Language Model for Zero-Shot Robot Manipulation