HuggingFace 每日AI论文速递

2025.04.03 | MergeVQ高效生成高质量图像,类R1-Zero提升视觉空间推理。


Listen Later

本期的 15 篇论文如下:

[00:23] 🎨 MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization(MergeVQ:一种用于视觉生成和表示的统一框架,具有解耦的Token合并和量化)

[01:00] 🧠 Improved Visual-Spatial Reasoning via R1-Zero-Like Training(通过类R1-Zero训练改进视觉空间推理)

[01:45] 🎮 AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction(动漫玩家:基于下一代游戏状态预测的无限动漫人生模拟)

[02:25] 🎬 VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step(VideoScene:提炼视频扩散模型以一步生成3D场景)

[03:03] 🎭 DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance(DreamActor-M1:基于混合引导的整体、富有表现力且鲁棒的人体图像动画)

[03:42] 🧐 Understanding R1-Zero-Like Training: A Critical Perspective(理解类R1-Zero训练:一个批判性的视角)

[04:28] 🎬 Towards Physically Plausible Video Generation via VLM Planning(基于视觉语言模型规划的物理合理视频生成)

[05:09] 🤖 PaperBench: Evaluating AI's Ability to Replicate AI Research(PaperBench:评估人工智能复现人工智能研究的能力)

[05:49] 🤖 ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations(ScholarCopilot:训练用于学术写作并提供精确引用的**大型语言模型**)

[06:31] 💡 ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement(ILLUME+:通过双重视觉Token化和扩散细化照亮统一的多模态大语言模型)

[07:11] 💃 Articulated Kinematics Distillation from Video Diffusion Models(基于视频扩散模型的铰接运动学提炼)

[07:51] 🛡 Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks(保障视觉-语言模型安全:缓解基于扰动攻击中高斯噪声的脆弱性)

[08:32] 👁 DASH: Detection and Assessment of Systematic Hallucinations of VLMs(DASH:视觉语言模型系统性幻觉的检测与评估)

[09:11] 🖼 Boost Your Human Image Generation Model via Direct Preference Optimization(通过直接偏好优化提升人体图像生成模型)

[09:47] 👁 LSNet: See Large, Focus Small(LSNet:观其大,聚焦小)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like HuggingFace 每日AI论文速递

View all
硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文,败类 by 思文败类

思文,败类

156 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners