HuggingFace 每日AI论文速递

2025.08.11 | GLM-4.5统一智能体推理编程;Voost高保真虚拟试穿试脱


Listen Later

本期的 11 篇论文如下:

[00:20] 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models(GLM-4.5:智能体、推理与编程(ARC)基础模型)

[00:47] 👕 Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off(Voost:一种统一且可扩展的双向虚拟试穿与试脱扩散Transformer)

[01:11] 🎯 InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization(InfiGUI-G1:通过自适应探索策略优化推进 GUI 元素定位能力)

[01:34] 🧠 Memp: Exploring Agent Procedural Memory(Memp:探索智能体程序性记忆)

[02:03] ✂ Pruning the Unsurprising: Efficient Code Reasoning via First-Token Surprisal(剪枝非关键信息:基于首令牌惊奇度的高效代码推理)

[02:29] 🪄 GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing(GENIE:用于神经辐射场交互式编辑的高斯编码)

[02:50] 📚 Adapting Vision-Language Models Without Labels: A Comprehensive Survey(无标签视觉-语言模型适应:一项全面综述)

[03:15] 🌍 MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs(MELLA:弥合低资源语言多模态大语言模型的语言能力与文化扎根性)

[03:37] 🧱 MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh(MeshLLM:赋能大型语言模型逐步理解和生成3D网格)

[04:02] 🎯 UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding(UI-AGILE:以有效强化学习和精准推断时定位提升图形用户界面智能体)

[04:30] ✨ LightSwitch: Multi-view Relighting with Material-guided Diffusion(光开关:基于材料引导扩散的多视角重照明)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like HuggingFace 每日AI论文速递

View all
硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文,败类 by 思文败类

思文,败类

156 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners