January 24, 2025

2025.01.24 | SRMT提升多智能体协作能力，VideoReward优化视频生成质量。

Listen Later

10 minutes

本期的 15 篇论文如下：

[00:26] 🧠 SRMT: Shared Memory for Multi-agent Lifelong Pathfinding（SRMT：多智能体终身路径规划中的共享记忆）

[01:05] 🎥 Improving Video Generation with Human Feedback（利用人类反馈改进视频生成）

[01:40] ⚡ Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models（Sigma：查询、键和值的差分重缩放以实现高效语言模型）

[02:20] 🖼 Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step（能否通过思维链生成图像？逐步验证和强化图像生成）

[02:55] 🖼 IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models（IMAGINE-E：最先进文本到图像模型的图像生成智能评估）

[03:32] 📚 Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos（Video-MMMU：评估从多学科专业视频中获取知识的能力）

[04:14] 🎥 DiffuEraser: A Diffusion Model for Video Inpainting（DiffuEraser：基于扩散模型的视频修复）

[04:50] 🎥 Temporal Preference Optimization for Long-Form Video Understanding（长视频理解中的时序偏好优化）

[05:29] 🎨 One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt（一提示一故事：使用单一提示实现免费午餐式一致的文本到图像生成）

[06:07] 🎥 EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion（EchoVideo：基于多模态特征融合的身份保持人类视频生成）

[06:42] 🧠 Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback（Step-KTO：通过逐步二元反馈优化数学推理）

[07:17] 🧠 Debate Helps Weak-to-Strong Generalization（辩论助力弱到强泛化）

[07:53] 🤔 Evolution and The Knightian Blindspot of Machine Learning（进化与机器学习的奈特盲点）

[08:30] 🧪 Hallucinations Can Improve Large Language Models in Drug Discovery（幻觉可以提升大语言模型在药物发现中的表现）

[09:10] 🌀 GSTAR: Gaussian Surface Tracking and Reconstruction（GSTAR：高斯曲面跟踪与重建）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

HuggingFace 每日AI论文速递

By duan

5

22 ratings

January 24, 2025

2025.01.24 | SRMT提升多智能体协作能力，VideoReward优化视频生成质量。

Listen Later

10 minutes

本期的 15 篇论文如下：

[00:26] 🧠 SRMT: Shared Memory for Multi-agent Lifelong Pathfinding（SRMT：多智能体终身路径规划中的共享记忆）

[01:05] 🎥 Improving Video Generation with Human Feedback（利用人类反馈改进视频生成）

[01:40] ⚡ Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models（Sigma：查询、键和值的差分重缩放以实现高效语言模型）

[02:20] 🖼 Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step（能否通过思维链生成图像？逐步验证和强化图像生成）

[02:55] 🖼 IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models（IMAGINE-E：最先进文本到图像模型的图像生成智能评估）

[03:32] 📚 Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos（Video-MMMU：评估从多学科专业视频中获取知识的能力）

[04:14] 🎥 DiffuEraser: A Diffusion Model for Video Inpainting（DiffuEraser：基于扩散模型的视频修复）

[04:50] 🎥 Temporal Preference Optimization for Long-Form Video Understanding（长视频理解中的时序偏好优化）

[05:29] 🎨 One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt（一提示一故事：使用单一提示实现免费午餐式一致的文本到图像生成）

[06:07] 🎥 EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion（EchoVideo：基于多模态特征融合的身份保持人类视频生成）

[06:42] 🧠 Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback（Step-KTO：通过逐步二元反馈优化数学推理）

[07:17] 🧠 Debate Helps Weak-to-Strong Generalization（辩论助力弱到强泛化）

[07:53] 🤔 Evolution and The Knightian Blindspot of Machine Learning（进化与机器学习的奈特盲点）

[08:30] 🧪 Hallucinations Can Improve Large Language Models in Drug Discovery（幻觉可以提升大语言模型在药物发现中的表现）

[09:10] 🌀 GSTAR: Gaussian Surface Tracking and Reconstruction（GSTAR：高斯曲面跟踪与重建）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

More shows like HuggingFace 每日AI论文速递

硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文，败类 by 思文败类

思文，败类

157 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners