HuggingFace 每日AI论文速递

2025.03.10 | 多模态任务新框架,俄语ICD编码提升。


Listen Later

本期的 20 篇论文如下:

[00:19] 🌐 Unified Reward Model for Multimodal Understanding and Generation(多模态理解和生成的统一奖励模型)

[01:04] 🇷 RuCCoD: Towards Automated ICD Coding in Russian(RuCCoD:面向俄语自动化的ICD编码研究)

[01:41] 🌍 EuroBERT: Scaling Multilingual Encoders for European Languages(EuroBERT:扩展欧洲语言的多语言编码器)

[02:28] 🗣 S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information(S2S-Arena:评估语音到语音协议在指令跟随中的副语言信息)

[03:08] 🧠 Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching(思维草图:结合认知启发草图的高效LLM推理)

[03:47] 🧠 Forgetting Transformer: Softmax Attention with a Forget Gate(遗忘Transformer:带遗忘门的Softmax注意力机制)

[04:28] 🧠 R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning(R1-Searcher:通过强化学习激励LLMs的搜索能力)

[05:19] 🎥 VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control(VideoPainter:任意长度视频修复与编辑的即插即用上下文控制)

[06:04] 🎭 R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning(R1-Omni:基于强化学习的可解释全模态情感识别)

[06:50] 🎥 TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models(TrajectoryCrafter:通过扩散模型重定向单目视频的相机轨迹)

[07:26] 🌊 ProReflow: Progressive Reflow with Decomposed Velocity(ProReflow:渐进式重流与分解速度)

[08:11] 🤖 BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities(BEHAVIOR机器人套件:简化日常家庭活动的全身操作)

[08:50] 🧠 An Empirical Study on Eliciting and Improving R1-like Reasoning Models(关于启发和提升类似R1推理模型的实证研究)

[09:27] 🧠 Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts(线性-专家混合模型:线性序列建模与专家混合模型的结合)

[10:13] 🧠 TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation(TinyR1-32B-Preview:通过分支-合并蒸馏提升准确性)

[10:56] 🧑 LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding(LONGCODEU:评估长上下文语言模型在长代码理解中的表现)

[11:41] 🔄 Learning from Failures in Multi-Attempt Reinforcement Learning(从失败中学习:多尝试强化学习)

[12:20] 🔍 SAGE: A Framework of Precise Retrieval for RAG(SAGE:RAG精准检索框架)

[13:01] 🧠 R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model(R1-Zero在2B非SFT模型上的视觉推理中的“顿悟时刻”)

[13:39] 🤖 Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles(初次了解你并更好地成为你:通过隐式用户画像建模人类对话模拟器)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like HuggingFace 每日AI论文速递

View all
硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文,败类 by 思文败类

思文,败类

157 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners