HuggingFace 每日AI论文速递

2025.12.19 | Kling-Omni一统视频生成;LLaDA2.0百亿扩散模型


Listen Later

本期的 14 篇论文如下:

[00:26] 🎬 Kling-Omni Technical Report(Kling-Omni技术报告)

[01:02] 🚀 LLaDA2.0: Scaling Up Diffusion Language Models to 100B(LLaDA2.0:将扩散语言模型扩展至1000亿参数)

[01:41] 🔮 Next-Embedding Prediction Makes Strong Vision Learners(下一嵌入预测构建强大的视觉学习器)

[02:27] 👓 StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors(StereoPilot:通过生成先验学习统一且高效的立体转换)

[02:58] 🎬 Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model(Seedance 1.5 pro:一个原生音视频联合生成基础模型)

[03:34] 🔭 Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation(全景深度估计基础模型:深度任意全景)

[04:11] 📸 Generative Refocusing: Flexible Defocus Control from a Single Image(生成式重聚焦:从单张图像实现灵活散焦控制)

[04:56] 🤖 Adaptation of Agentic AI(智能体人工智能的适应性研究)

[05:36] ⚗ Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection(炼金术士:通过元梯度数据选择提升文本到图像模型训练效率)

[06:12] 🛡 DeContext as Defense: Safe Image Editing in Diffusion Transformers(以去上下文为防御:扩散变换器中的安全图像编辑)

[06:58] 🧭 N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models(N3D-VLM:原生3D基础实现视觉语言模型中的精确空间推理)

[07:49] 🎨 The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text(世界即画布:用参考图像、轨迹和文本绘制可提示事件)

[08:30] 🔧 AdaTooler-V: Adaptive Tool-Use for Images and Videos(AdaTooler-V:面向图像与视频的自适应工具使用)

[09:19] 🤔 Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward(探索与利用之辩:通过裁剪、熵与虚假奖励重新审视RLVR)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like HuggingFace 每日AI论文速递

View all
硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

291 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

295 Listeners

思文,败类 by 思文败类

思文,败类

156 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

135 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners