HuggingFace 每日AI论文速递

2024.10.15 每日AI论文 | MMIE推动LVLMs发展,LOKI评估合成数据检测。


Listen Later

本期的 15 篇论文如下:

[00:24] 🌐 MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models(大规模多模态交错理解基准测试)

[01:06] 🤖 LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models(LOKI:基于大型多模态模型的综合合成数据检测基准)

[02:01] 🔍 Toward General Instruction-Following Alignment for Retrieval-Augmented Generation(面向检索增强生成的通用指令遵循对齐)

[02:36] 📊 MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks(MEGA-Bench:将多模态评估扩展到500多个真实世界任务)

[03:12] 🎥 Animate-X: Universal Character Image Animation with Enhanced Motion Representation(Animate-X:增强运动表示的通用角色图像动画)

[04:02] 📚 Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models(全能数学:面向大型语言模型的奥林匹克级数学基准)

[04:44] 📚 LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content(LiveXiv -- 基于Arxiv论文内容的多模态实时基准)

[05:29] 🎥 Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention(Cavia:具有视角控制的多视角视频扩散与视角集成注意力)

[06:09] ⏳ TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models(时间轴基准:多模态视频模型细粒度时间理解评测)

[06:58] 🌊 Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations(基于校正随机微分方程的语义图像反演与编辑)

[07:40] 📊 Rethinking Data Selection at Scale: Random Selection is Almost All You Need(重新思考大规模数据选择:随机选择几乎是你所需要的)

[08:26] 🌲 Tree of Problems: Improving structured problem solving with compositionality(问题树:通过组合性改进结构化问题解决)

[09:13] 📺 TVBench: Redesigning Video-Language Evaluation(TVBench:重塑视频语言评估)

[09:54] 🤖 Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies(可泛化的人形机器人操作:改进的三维扩散策略)

[10:29] 📚 LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory(长时记忆评估:在长期交互记忆中评估聊天助手)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan