January 13, 2025

2025.01.13 | OmniManip实现通用机器人操作，VideoRAG提升视频检索生成性能。

Listen Later

7 minutes

本期的 10 篇论文如下：

[00:24] 🤖 OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints（OmniManip：通过以对象为中心的交互原语作为空间约束实现通用机器人操作）

[01:02] 🎥 VideoRAG: Retrieval-Augmented Generation over Video Corpus（VideoRAG：基于视频语料库的检索增强生成）

[01:38] 🎥 OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?（OVO-Bench：你的视频大语言模型离现实世界在线视频理解还有多远？）

[02:26] 🧠 LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs（LlamaV-o1：重新思考大语言模型中的逐步视觉推理）

[03:01] 🧠 Enabling Scalable Oversight via Self-Evolving Critic（通过自进化批评实现可扩展监督）

[03:34] 🎥 ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning（ConceptMaster：无需测试时调优的扩散变换器模型上的多概念视频定制）

[04:09] 🎥 Multi-subject Open-set Personalization in Video Generation（多主体开放集个性化视频生成）

[04:47] 🔍 ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding（ReFocus：视觉编辑作为结构化图像理解的思维链）

[05:23] 🤖 Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains（多智能体微调：通过多样化推理链实现自我改进）

[06:00] 🦠 Infecting Generative AI With Viruses（感染生成式人工智能的病毒）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

HuggingFace 每日AI论文速递

By duan

5

22 ratings

January 13, 2025

2025.01.13 | OmniManip实现通用机器人操作，VideoRAG提升视频检索生成性能。

Listen Later

7 minutes

本期的 10 篇论文如下：

[00:24] 🤖 OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints（OmniManip：通过以对象为中心的交互原语作为空间约束实现通用机器人操作）

[01:02] 🎥 VideoRAG: Retrieval-Augmented Generation over Video Corpus（VideoRAG：基于视频语料库的检索增强生成）

[01:38] 🎥 OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?（OVO-Bench：你的视频大语言模型离现实世界在线视频理解还有多远？）

[02:26] 🧠 LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs（LlamaV-o1：重新思考大语言模型中的逐步视觉推理）

[03:01] 🧠 Enabling Scalable Oversight via Self-Evolving Critic（通过自进化批评实现可扩展监督）

[03:34] 🎥 ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning（ConceptMaster：无需测试时调优的扩散变换器模型上的多概念视频定制）

[04:09] 🎥 Multi-subject Open-set Personalization in Video Generation（多主体开放集个性化视频生成）

[04:47] 🔍 ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding（ReFocus：视觉编辑作为结构化图像理解的思维链）

[05:23] 🤖 Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains（多智能体微调：通过多样化推理链实现自我改进）

[06:00] 🦠 Infecting Generative AI With Viruses（感染生成式人工智能的病毒）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

More shows like HuggingFace 每日AI论文速递

硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文，败类 by 思文败类

思文，败类

157 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners