【赞助商】
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd
【目录】
本期的 15 篇论文如下:
[00:31] 🤖 Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey(基于大语言模型的软件工程问题解决:进展、前沿与全面综述)
[01:15] 🔮 FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs(FutureOmni:评估多模态大语言模型基于全模态上下文进行未来预测的能力)
[02:11] ⚡ Toward Efficient Agents: Memory, Tool learning, and Planning(迈向高效智能体:记忆、工具学习与规划)
[02:51] 🤖 Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization(Being-H0.5:基于人类中心机器人学习的跨具身泛化扩展)
[03:40] 🎬 OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer(OmniTransfer:时空视频迁移的一体化框架)
[04:28] 🧠 $\texttt{MemoryRewardBench}$: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models(《MemoryRewardBench:面向大语言模型长期记忆管理的奖励模型基准评测》)
[05:15] 🧠 Think3D: Thinking with Space for Spatial Reasoning(Think3D:利用空间进行空间推理的思考)
[06:06] 🫁 UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation(UniX:统一自回归与扩散模型用于胸部X光片理解与生成)
[07:08] ⚙ ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents(ToolPRMBench:评估和推进工具使用智能体的过程奖励模型)
[07:58] 🧠 Aligning Agentic World Models via Knowledgeable Experience Learning(通过知识化经验学习对齐具身世界模型)
[08:45] 🤖 Agentic-R: Learning to Retrieve for Agentic Search(Agentic-R:面向智能体搜索的检索学习)
[09:25] 🔤 LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR(LightOnOCR:一个用于最先进OCR的10亿参数端到端多语言视觉语言模型)
[10:14] 📊 PRiSM: Benchmarking Phone Realization in Speech Models(PRiSM:语音模型中音素实现的基准测试)
[11:02] 🔍 On the Evidentiary Limits of Membership Inference for Copyright Auditing(论成员推理在版权审计中的证据性局限)
[11:46] 🔒 Fundamental Limitations of Favorable Privacy-Utility Guarantees for DP-SGD(差分隐私随机梯度下降(DP-SGD)中有利隐私-效用保证的基本局限性)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递