
Sign up to save your podcasts
Or
本期的 13 篇论文如下:
[00:26] 🔗 Law of the Weakest Link: Cross Capabilities of Large Language Models(最弱环节法则:大型语言模型的跨能力)
[01:05] 🌐 TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices(TPI-LLM:在低资源边缘设备上高效服务70B规模的大型语言模型)
[01:46] 🌍 Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect(Atlas-Chat:为低资源摩洛哥阿拉伯方言定制的大型语言模型)
[02:22] 🎥 One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos(一令分段:视频中的语言指令推理分割)
[02:59] 🌐 Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation(Flex3D:利用灵活的重建模型和输入视图优化进行前馈3D生成)
[03:46] 🎨 Illustrious: an Open Advanced Illustration Model(辉煌:一个开放的高级插画模型)
[04:22] 🚗 SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs(通过3D语义MPIs合成几何控制街景图像)
[05:00] 📸 Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration(后验均值校正流:迈向最小均方误差照片真实图像恢复)
[05:47] 🎨 ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer(遵循扩散变换器的全方位创作者和编辑)
[06:22] 🎥 Visual Context Window Extension: A New Perspective for Long Video Understanding(视觉上下文窗口扩展:长视频理解的新视角)
[07:05] 🤖 Helpful DoggyBot: Open-World Object Fetching using Legged Robots and Vision-Language Models(帮助型DoggyBot:使用四足机器人和视觉语言模型进行开放世界物体抓取)
[07:46] 🎥 DressRecon: Freeform 4D Human Reconstruction from Monocular Video(DressRecon:单目视频中的自由形式4D人体重建)
[08:32] 🤖 What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study(性别偏见的影响?通过以人为本的研究量化机器翻译中的性别偏见)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
本期的 13 篇论文如下:
[00:26] 🔗 Law of the Weakest Link: Cross Capabilities of Large Language Models(最弱环节法则:大型语言模型的跨能力)
[01:05] 🌐 TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices(TPI-LLM:在低资源边缘设备上高效服务70B规模的大型语言模型)
[01:46] 🌍 Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect(Atlas-Chat:为低资源摩洛哥阿拉伯方言定制的大型语言模型)
[02:22] 🎥 One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos(一令分段:视频中的语言指令推理分割)
[02:59] 🌐 Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation(Flex3D:利用灵活的重建模型和输入视图优化进行前馈3D生成)
[03:46] 🎨 Illustrious: an Open Advanced Illustration Model(辉煌:一个开放的高级插画模型)
[04:22] 🚗 SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs(通过3D语义MPIs合成几何控制街景图像)
[05:00] 📸 Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration(后验均值校正流:迈向最小均方误差照片真实图像恢复)
[05:47] 🎨 ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer(遵循扩散变换器的全方位创作者和编辑)
[06:22] 🎥 Visual Context Window Extension: A New Perspective for Long Video Understanding(视觉上下文窗口扩展:长视频理解的新视角)
[07:05] 🤖 Helpful DoggyBot: Open-World Object Fetching using Legged Robots and Vision-Language Models(帮助型DoggyBot:使用四足机器人和视觉语言模型进行开放世界物体抓取)
[07:46] 🎥 DressRecon: Freeform 4D Human Reconstruction from Monocular Video(DressRecon:单目视频中的自由形式4D人体重建)
[08:32] 🤖 What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study(性别偏见的影响?通过以人为本的研究量化机器翻译中的性别偏见)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递