## Episode Summary
In this episode, we cover:
- **HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?** (Hugging Face Daily)
- [Read more](https://huggingface.co/papers/2604.09408)
- **Counting as a minimal probe of language model reliability** (Hugging Face Daily)
- [Read more](https://huggingface.co/papers/2605.02028)
- **Linear-Time Global Visual Modeling without Explicit Attention** (Hugging Face Daily)
- [Read more](https://huggingface.co/papers/2605.01711)
- **Assessing Pancreatic Ductal Adenocarcinoma Vascular Invasion: the PDACVI Benchmark** (Hugging Face Daily)
- [Read more](https://huggingface.co/papers/2604.27582)
- **Prior-Aligned Data Cleaning for Tabular Foundation Models** (Hugging Face Daily)
- [Read more](https://huggingface.co/papers/2604.25154)
---
*Sponsored by LimitLess AI*