【本日の論文】
1. From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company
https://huggingface.co/papers/2604.22446
2. World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
https://huggingface.co/papers/2604.24764
3. ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning
https://huggingface.co/papers/2604.24300
4. Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
https://huggingface.co/papers/2604.24763
5. Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms
https://huggingface.co/papers/2604.23775