HuggingFace 每日AI论文速递

2025.09.09 | REER提升推理性能;WebExplorer训练智能体


Listen Later

本期的 15 篇论文如下:

[00:21] 💡 Reverse-Engineered Reasoning for Open-Ended Generation(面向开放式生成的逆向工程推理)

[00:47] 🌐 WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents(WebExplorer:探索与演进,用于训练长周期网络智能体)

[01:17] 🚀 Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models(革新扩散大语言模型的强化学习框架)

[01:38] 🤔 Does DINOv3 Set a New Medical Vision Standard?(DINOv3 能否树立医学视觉新标准?)

[02:06] 🛠 Reinforced Visual Perception with Tools(基于工具的强化视觉感知)

[02:26] 🤖 Reinforcement Learning Foundations for Deep Research Systems: A Survey(深度研究系统中的强化学习基础:综述)

[02:55] 👁 Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning(通过对比注意力聚焦:增强VLM的视觉推理能力)

[03:28] 🎥 UniVerse-1: Unified Audio-Video Generation via Stitching of Experts(UniVerse-1:通过专家模型拼接实现统一音视频生成)

[03:50] 🤔 Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?(绘画易于思考:文生图模型能布景,但无法主导剧情吗?)

[04:12] 🤔 Interleaving Reasoning for Better Text-to-Image Generation(通过交错推理提升文本到图像生成)

[04:37] 🤖 Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents(Paper2Agent:将研究论文重构为交互式可靠的AI代理)

[05:05] ⚙ Guided Decoding and Its Critical Role in Retrieval-Augmented Generation(引导式解码及其在检索增强生成中的关键作用)

[05:36] 🚀 Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers(扩展用于大型语言模型分步证明器的多轮离策略强化学习和多智能体树搜索)

[06:04] 🛡 \texttt{R$^\textbf{2}$AI}: Towards Resistant and Resilient AI in an Evolving World(R$^2$AI:迈向演进世界中的抵抗性与韧性AI)

[06:30] 🌍 Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian(Llama-GENBA-10B:一个德语、英语和巴伐利亚语三语大型语言模型)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like HuggingFace 每日AI论文速递

View all
硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文,败类 by 思文败类

思文,败类

157 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners