December 31, 2024

2024.12.31 每日AI论文 | 解释性指令提升视觉任务泛化，多模态模型优化医学影像泛化。

Listen Later

7 minutes

本期的 10 篇论文如下：

[00:25] 🔍 Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization（解释性指令：迈向统一视觉任务理解与零样本泛化）

[01:13] 🧠 On the Compositional Generalization of Multimodal LLMs for Medical Imaging（多模态大语言模型在医学影像中的组合泛化研究）

[02:02] ⚙ Efficiently Serving LLM Reasoning Programs with Certaindex（高效服务LLM推理程序的Certaindex系统）

[02:44] 🎨 Edicho: Consistent Image Editing in the Wild（Edicho：在野外图像中的一致性编辑）

[03:22] 🎵 TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization（TangoFlux：基于流匹配和CLAP排序偏好优化的超快速且忠实文本到音频生成）

[04:04] 🎥 Bringing Objects to Life: 4D generation from 3D objects（赋予物体生命：从3D物体生成4D内容）

[04:47] 🧠 Facilitating large language model Russian adaptation with Learned Embedding Propagation（通过学习嵌入传播促进大语言模型的俄语适应）

[05:25] 🤖 HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation（HumanEval Pro与MBPP Pro：评估大语言模型在自调用代码生成上的表现）

[06:12] 🤖 Training Software Engineering Agents and Verifiers with SWE-Gym（使用SWE-Gym训练软件工程代理与验证器）

[06:52] 🧠 OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System（OneKE：基于Docker化模式引导的LLM代理知识提取系统）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

HuggingFace 每日AI论文速递

By duan

5

22 ratings

December 31, 2024

2024.12.31 每日AI论文 | 解释性指令提升视觉任务泛化，多模态模型优化医学影像泛化。

Listen Later

7 minutes

本期的 10 篇论文如下：

[00:25] 🔍 Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization（解释性指令：迈向统一视觉任务理解与零样本泛化）

[01:13] 🧠 On the Compositional Generalization of Multimodal LLMs for Medical Imaging（多模态大语言模型在医学影像中的组合泛化研究）

[02:02] ⚙ Efficiently Serving LLM Reasoning Programs with Certaindex（高效服务LLM推理程序的Certaindex系统）

[02:44] 🎨 Edicho: Consistent Image Editing in the Wild（Edicho：在野外图像中的一致性编辑）

[03:22] 🎵 TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization（TangoFlux：基于流匹配和CLAP排序偏好优化的超快速且忠实文本到音频生成）

[04:04] 🎥 Bringing Objects to Life: 4D generation from 3D objects（赋予物体生命：从3D物体生成4D内容）

[04:47] 🧠 Facilitating large language model Russian adaptation with Learned Embedding Propagation（通过学习嵌入传播促进大语言模型的俄语适应）

[05:25] 🤖 HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation（HumanEval Pro与MBPP Pro：评估大语言模型在自调用代码生成上的表现）

[06:12] 🤖 Training Software Engineering Agents and Verifiers with SWE-Gym（使用SWE-Gym训练软件工程代理与验证器）

[06:52] 🧠 OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System（OneKE：基于Docker化模式引导的LLM代理知识提取系统）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

More shows like HuggingFace 每日AI论文速递

硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文，败类 by 思文败类

思文，败类

157 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners