Models & Agents

Ep 21: New arXiv papers expose critical flaws in how we evaluate depression-detection models, LLM pruning, and verbalized confidence.


Listen Later

**Models & Agents**
**Date:** March 27, 2026
**HOOK:** New arXiv papers expose critical flaws in how we evaluate depression-detection models, LLM pruning, and verbalized confidence.
**What You Need to Know:** Today's cs.CL batch reveals that many impressive medical AI results may be artifacts of interviewer prompts rather than genuine participant signals, while pruning works for classification but breaks generation due to probability-space amplification. ...
AI Disclosure: This podcast is curated by Patrick but uses AI-generated voice synthesis (ElevenLabs) for audio production.
...more
View all episodesView all episodes
Download on the App Store

Models & AgentsBy Patrick