## Episode Summary
In this episode, we cover:
- **Forecasting Downstream Performance of LLMs With Proxy Metrics** (Hugging Face Daily)
- [Read more](https://huggingface.co/papers/2605.18607)
- **DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback** (arXiv)
- [Read more](http://arxiv.org/abs/2605.22781v1)
- **Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search** (Hugging Face Daily)
- [Read more](https://huggingface.co/papers/2605.20244)
- **AutoRubric-T2I: Robust Rule-Based Reward Model for Text-to-Image Alignment** (Hugging Face Daily)
- [Read more](https://huggingface.co/papers/2605.17602)
- **Forecasting Scientific Progress with Artificial Intelligence** (Hugging Face Daily)
- [Read more](https://huggingface.co/papers/2605.22681)
---
*Sponsored by LimitLess AI*