本期的 15 篇论文如下:
[00:24] 💡 Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models(超越“Aha!”时刻:迈向大型推理模型中系统性元能力对齐)
[01:02] 🤖 System Prompt Optimization with Meta-Learning(基于元学习的系统提示优化)
[01:47] 🤖 EnerVerse-AC: Envisioning Embodied Environments with Action Condition(EnerVerse-AC:通过动作条件设想具身环境)
[02:29] 🧠 The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think(CoT百科全书:分析、预测和控制推理模型如何思考)
[03:17] 🤖 EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models(EWMBench:具身世界模型中场景、运动和语义质量的评估)
[03:57] 🖼 End-to-End Vision Tokenizer Tuning(端到端视觉标记器调优)
[04:34] 📈 WorldPM: Scaling Human Preference Modeling(世界偏好建模:扩展人类偏好模型)
[05:13] 🤖 MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering(MLE-Dojo:用于增强机器学习工程中LLM代理的交互式环境)
[06:01] 🧩 Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning(通过启发式适配和超Token学习实现语言模型中的Tokenizer灵活性)
[06:43] 🎨 Style Customization of Text-to-Vector Generation with Image Diffusion Priors(基于图像扩散先验的文本到矢量生成风格定制)
[07:25] 🧠 J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning(J1:通过强化学习激励LLM作为裁判时的思考)
[08:07] 👉 PointArena: Probing Multimodal Grounding Through Language-Guided Pointing(PointArena:通过语言引导的指向探测多模态理解)
[08:47] 🖼 Depth Anything with Any Prior(任意先验的深度感知)
[09:29] 🖼 OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning(OpenThinkIMG: 通过视觉工具强化学习,学习用图像思考)
[10:14] 🚀 Parallel Scaling Law for Language Models(语言模型的并行扩展法则)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递