Embodied AI 101

CaP-X: LMs' First Physical Exam


Listen Later

A novel benchmark that evaluates language models on physical examination tasks, testing their ability to understand and perform clinical physical exam procedures in simulated environments. This work introduces a comprehensive evaluation framework for AI systems in medical/clinical settings.
...more
View all episodesView all episodes
Download on the App Store

Embodied AI 101By Shaoqing Tan