October 21, 2025

2025.10.21 | 模型不懂光影折射；小模型也能写报告

Listen Later

10 minutes

本期的 13 篇论文如下：

[00:21] 🪞 PICABench: How Far Are We from Physically Realistic Image Editing?（PICABench：我们离物理真实的图像编辑还有多远？）

[01:04] 🤖 DeepAnalyze: Agentic Large Language Models for Autonomous Data Science（DeepAnalyze：面向自主数据科学的智能体大模型）

[01:50] 🗜 Glyph: Scaling Context Windows via Visual-Text Compression（Glyph：通过视觉-文本压缩扩展上下文窗口长度）

[02:23] 🔍 Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation（面向通用检索增强生成的混合模态检索研究）

[03:10] 🔗 When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling（何时集成：定位Token级位置实现稳定高效的大模型集成）

[04:09] 🎯 Annotation-Efficient Universal Honesty Alignment（注释高效型通用诚实对齐）

[04:49] 🖌 Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback（Uniworld-V2：借助扩散负感知微调与MLLM隐式反馈强化图像编辑）

[05:46] 👁 RL makes MLLMs see better than SFT（强化学习让多模态大模型看得比监督微调更清楚）

[06:33] 🚀 Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling（视觉自回归模型在推理时扩展上击败扩散模型）

[07:09] 🎨 ConsistEdit: Highly Consistent and Precise Training-free Visual Editing（ConsistEdit：面向MM-DiT的高一致免训练视觉编辑）

[07:56] 🔄 Deep Self-Evolving Reasoning（深度自演化推理）

[08:22] 🧠 Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI（超越流水线：模型原生智能体AI范式转移综述）

[09:07] 🔮 Chronos-2: From Univariate to Universal Forecasting（Chronos-2：从单变量到通用预测）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

HuggingFace 每日AI论文速递

By duan

5

22 ratings

October 21, 2025

2025.10.21 | 模型不懂光影折射；小模型也能写报告

Listen Later

10 minutes

本期的 13 篇论文如下：

[00:21] 🪞 PICABench: How Far Are We from Physically Realistic Image Editing?（PICABench：我们离物理真实的图像编辑还有多远？）

[01:04] 🤖 DeepAnalyze: Agentic Large Language Models for Autonomous Data Science（DeepAnalyze：面向自主数据科学的智能体大模型）

[01:50] 🗜 Glyph: Scaling Context Windows via Visual-Text Compression（Glyph：通过视觉-文本压缩扩展上下文窗口长度）

[02:23] 🔍 Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation（面向通用检索增强生成的混合模态检索研究）

[03:10] 🔗 When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling（何时集成：定位Token级位置实现稳定高效的大模型集成）

[04:09] 🎯 Annotation-Efficient Universal Honesty Alignment（注释高效型通用诚实对齐）

[04:49] 🖌 Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback（Uniworld-V2：借助扩散负感知微调与MLLM隐式反馈强化图像编辑）

[05:46] 👁 RL makes MLLMs see better than SFT（强化学习让多模态大模型看得比监督微调更清楚）

[06:33] 🚀 Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling（视觉自回归模型在推理时扩展上击败扩散模型）

[07:09] 🎨 ConsistEdit: Highly Consistent and Precise Training-free Visual Editing（ConsistEdit：面向MM-DiT的高一致免训练视觉编辑）

[07:56] 🔄 Deep Self-Evolving Reasoning（深度自演化推理）

[08:22] 🧠 Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI（超越流水线：模型原生智能体AI范式转移综述）

[09:07] 🔮 Chronos-2: From Univariate to Universal Forecasting（Chronos-2：从单变量到通用预测）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

More shows like HuggingFace 每日AI论文速递

硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文，败类 by 思文败类

思文，败类

156 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners