October 15, 2024

2024.10.15 每日AI论文 | MMIE推动LVLMs发展，LOKI评估合成数据检测。

Listen Later

11 minutes

本期的 15 篇论文如下：

[00:24] 🌐 MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models（大规模多模态交错理解基准测试）

[01:06] 🤖 LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models（LOKI：基于大型多模态模型的综合合成数据检测基准）

[02:01] 🔍 Toward General Instruction-Following Alignment for Retrieval-Augmented Generation（面向检索增强生成的通用指令遵循对齐）

[02:36] 📊 MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks（MEGA-Bench：将多模态评估扩展到500多个真实世界任务）

[03:12] 🎥 Animate-X: Universal Character Image Animation with Enhanced Motion Representation（Animate-X：增强运动表示的通用角色图像动画）

[04:02] 📚 Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models（全能数学：面向大型语言模型的奥林匹克级数学基准）

[04:44] 📚 LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content（LiveXiv -- 基于Arxiv论文内容的多模态实时基准）

[05:29] 🎥 Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention（Cavia：具有视角控制的多视角视频扩散与视角集成注意力）

[06:09] ⏳ TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models（时间轴基准：多模态视频模型细粒度时间理解评测）

[06:58] 🌊 Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations（基于校正随机微分方程的语义图像反演与编辑）

[07:40] 📊 Rethinking Data Selection at Scale: Random Selection is Almost All You Need（重新思考大规模数据选择：随机选择几乎是你所需要的）

[08:26] 🌲 Tree of Problems: Improving structured problem solving with compositionality（问题树：通过组合性改进结构化问题解决）

[09:13] 📺 TVBench: Redesigning Video-Language Evaluation（TVBench：重塑视频语言评估）

[09:54] 🤖 Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies（可泛化的人形机器人操作：改进的三维扩散策略）

[10:29] 📚 LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory（长时记忆评估：在长期交互记忆中评估聊天助手）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

HuggingFace 每日AI论文速递

By duan

5

22 ratings

October 15, 2024

2024.10.15 每日AI论文 | MMIE推动LVLMs发展，LOKI评估合成数据检测。

Listen Later

11 minutes

本期的 15 篇论文如下：

[00:24] 🌐 MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models（大规模多模态交错理解基准测试）

[01:06] 🤖 LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models（LOKI：基于大型多模态模型的综合合成数据检测基准）

[02:01] 🔍 Toward General Instruction-Following Alignment for Retrieval-Augmented Generation（面向检索增强生成的通用指令遵循对齐）

[02:36] 📊 MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks（MEGA-Bench：将多模态评估扩展到500多个真实世界任务）

[03:12] 🎥 Animate-X: Universal Character Image Animation with Enhanced Motion Representation（Animate-X：增强运动表示的通用角色图像动画）

[04:02] 📚 Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models（全能数学：面向大型语言模型的奥林匹克级数学基准）

[04:44] 📚 LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content（LiveXiv -- 基于Arxiv论文内容的多模态实时基准）

[05:29] 🎥 Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention（Cavia：具有视角控制的多视角视频扩散与视角集成注意力）

[06:09] ⏳ TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models（时间轴基准：多模态视频模型细粒度时间理解评测）

[06:58] 🌊 Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations（基于校正随机微分方程的语义图像反演与编辑）

[07:40] 📊 Rethinking Data Selection at Scale: Random Selection is Almost All You Need（重新思考大规模数据选择：随机选择几乎是你所需要的）

[08:26] 🌲 Tree of Problems: Improving structured problem solving with compositionality（问题树：通过组合性改进结构化问题解决）

[09:13] 📺 TVBench: Redesigning Video-Language Evaluation（TVBench：重塑视频语言评估）

[09:54] 🤖 Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies（可泛化的人形机器人操作：改进的三维扩散策略）

[10:29] 📚 LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory（长时记忆评估：在长期交互记忆中评估聊天助手）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

More shows like HuggingFace 每日AI论文速递

硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

291 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

294 Listeners

思文，败类 by 思文败类

思文，败类

157 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners