【本日の論文】
1. Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players
https://huggingface.co/papers/2605.28816
2. ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation
https://huggingface.co/papers/2605.28293
3. Agent Explorative Policy Optimization for Multimodal Agentic Reasoning
https://huggingface.co/papers/2605.28774
4. From Pixels to Words -- Towards Native One-Vision Models at Scale
https://huggingface.co/papers/2605.28820
5. Self-Improving Language Models with Bidirectional Evolutionary Search
https://huggingface.co/papers/2605.28814