
Sign up to save your podcasts
Or
本期的 10 篇论文如下:
[00:25] 🔍 Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization(解释性指令:迈向统一视觉任务理解与零样本泛化)
[01:13] 🧠 On the Compositional Generalization of Multimodal LLMs for Medical Imaging(多模态大语言模型在医学影像中的组合泛化研究)
[02:02] ⚙ Efficiently Serving LLM Reasoning Programs with Certaindex(高效服务LLM推理程序的Certaindex系统)
[02:44] 🎨 Edicho: Consistent Image Editing in the Wild(Edicho:在野外图像中的一致性编辑)
[03:22] 🎵 TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization(TangoFlux:基于流匹配和CLAP排序偏好优化的超快速且忠实文本到音频生成)
[04:04] 🎥 Bringing Objects to Life: 4D generation from 3D objects(赋予物体生命:从3D物体生成4D内容)
[04:47] 🧠 Facilitating large language model Russian adaptation with Learned Embedding Propagation(通过学习嵌入传播促进大语言模型的俄语适应)
[05:25] 🤖 HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation(HumanEval Pro与MBPP Pro:评估大语言模型在自调用代码生成上的表现)
[06:12] 🤖 Training Software Engineering Agents and Verifiers with SWE-Gym(使用SWE-Gym训练软件工程代理与验证器)
[06:52] 🧠 OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System(OneKE:基于Docker化模式引导的LLM代理知识提取系统)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
本期的 10 篇论文如下:
[00:25] 🔍 Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization(解释性指令:迈向统一视觉任务理解与零样本泛化)
[01:13] 🧠 On the Compositional Generalization of Multimodal LLMs for Medical Imaging(多模态大语言模型在医学影像中的组合泛化研究)
[02:02] ⚙ Efficiently Serving LLM Reasoning Programs with Certaindex(高效服务LLM推理程序的Certaindex系统)
[02:44] 🎨 Edicho: Consistent Image Editing in the Wild(Edicho:在野外图像中的一致性编辑)
[03:22] 🎵 TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization(TangoFlux:基于流匹配和CLAP排序偏好优化的超快速且忠实文本到音频生成)
[04:04] 🎥 Bringing Objects to Life: 4D generation from 3D objects(赋予物体生命:从3D物体生成4D内容)
[04:47] 🧠 Facilitating large language model Russian adaptation with Learned Embedding Propagation(通过学习嵌入传播促进大语言模型的俄语适应)
[05:25] 🤖 HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation(HumanEval Pro与MBPP Pro:评估大语言模型在自调用代码生成上的表现)
[06:12] 🤖 Training Software Engineering Agents and Verifiers with SWE-Gym(使用SWE-Gym训练软件工程代理与验证器)
[06:52] 🧠 OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System(OneKE:基于Docker化模式引导的LLM代理知识提取系统)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递