October 02, 2024

2024.10.02 每日AI论文 | 跨能力任务表现受限，边缘设备高效部署模型

9 minutes

本期的 13 篇论文如下：

[00:26] 🔗 Law of the Weakest Link: Cross Capabilities of Large Language Models（最弱环节法则：大型语言模型的跨能力）

[01:05] 🌐 TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices（TPI-LLM：在低资源边缘设备上高效服务70B规模的大型语言模型）

[01:46] 🌍 Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect（Atlas-Chat：为低资源摩洛哥阿拉伯方言定制的大型语言模型）

[02:22] 🎥 One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos（一令分段：视频中的语言指令推理分割）

[02:59] 🌐 Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation（Flex3D：利用灵活的重建模型和输入视图优化进行前馈3D生成）

[03:46] 🎨 Illustrious: an Open Advanced Illustration Model（辉煌：一个开放的高级插画模型）

[04:22] 🚗 SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs（通过3D语义MPIs合成几何控制街景图像）

[05:00] 📸 Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration（后验均值校正流：迈向最小均方误差照片真实图像恢复）

[05:47] 🎨 ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer（遵循扩散变换器的全方位创作者和编辑）

[06:22] 🎥 Visual Context Window Extension: A New Perspective for Long Video Understanding（视觉上下文窗口扩展：长视频理解的新视角）

[07:05] 🤖 Helpful DoggyBot: Open-World Object Fetching using Legged Robots and Vision-Language Models（帮助型DoggyBot：使用四足机器人和视觉语言模型进行开放世界物体抓取）

[07:46] 🎥 DressRecon: Freeform 4D Human Reconstruction from Monocular Video（DressRecon：单目视频中的自由形式4D人体重建）

[08:32] 🤖 What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study（性别偏见的影响？通过以人为本的研究量化机器翻译中的性别偏见）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

View all episodes

By duan

22 ratings