HuggingFace 每日AI论文速递

2025.08.04 | 扩散语言模型变长去噪,高效省资源;PixNerd图像扩散,高效高质量。


Listen Later

本期的 11 篇论文如下:

[00:22] 🔄 Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models(超越固定长度:扩散大语言模型的可变长度去噪)

[00:44] 🎨 PixNerd: Pixel Neural Field Diffusion(PixNerd:像素神经场扩散)

[01:11] 💡 SWE-Exp: Experience-Driven Software Issue Resolution(SWE-Exp:经验驱动的软件问题解决)

[01:38] 🔍 Multimodal Referring Segmentation: A Survey(多模态指代表达分割:一项综述)

[01:59] 🧠 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding(3D-R1:增强3D VLM的推理能力以实现统一场景理解)

[02:40] 🤖 SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution(SWE-Debate:用于软件问题解决的竞争性多智能体辩论)

[03:05] ⚖ Learning an Efficient Multi-Turn Dialogue Evaluator from Multiple Judges(从多个评委中学习高效的多轮对话评估器)

[03:33] 🤯 Investigating Hallucination in Conversations for Low Resource Languages(研究低资源语言对话中的幻觉现象)

[04:00] 🧭 IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation(IGL-Nav:用于图像目标导航的增量式三维高斯定位)

[04:30] 🎧 SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation(SpA2V: 利用空间听觉线索进行音频驱动的空间感知视频生成)

[04:55] 🎮 Multi-Agent Game Generation and Evaluation via Audio-Visual Recordings(多智能体游戏生成与评估基于视听记录)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like HuggingFace 每日AI论文速递

View all
硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文,败类 by 思文败类

思文,败类

156 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners