April 02, 2026

CaP-X: LMs' First Physical Exam

22 minutes

A novel benchmark that evaluates language models on physical examination tasks, testing their ability to understand and perform clinical physical exam procedures in simulated environments. This work introduces a comprehensive evaluation framework for AI systems in medical/clinical settings.

...more

View all episodes

By Shaoqing Tan

April 02, 2026

CaP-X: LMs' First Physical Exam

22 minutes

...more

Share CaP-X: LMs' First Physical Exam

Sign up to save your podcasts

CaP-X: LMs' First Physical Exam

CaP-X: LMs' First Physical Exam