HuggingFace 每日AI论文速递

2024.11.06 每日AI论文 | HTML提升RAG性能,分子图助手优化多模态任务


Listen Later

本期的 11 篇论文如下:

[00:30] 📄 HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems(HtmlRAG:在RAG系统中,HTML比纯文本更适合建模检索知识)

[01:12] 🧬 LLaMo: Large Language Model-based Molecular Graph Assistant(基于大型语言模型的分子图助手)

[01:52] 🤖 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution(DeeR-VLA:动态推理多模态大语言模型以实现高效机器人执行)

[02:28] 🤖 Sample-Efficient Alignment for LLMs(LLM的高效对齐方法)

[03:01] 🚦 Controlling Language and Diffusion Models by Transporting Activations(通过传输激活控制语言和扩散模型)

[03:49] 🌟 DreamPolish: Domain Score Distillation With Progressive Geometry Generation(梦幻抛光:基于渐进几何生成的领域分数蒸馏)

[04:32] 🦓 Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge(斑马-羊驼:一种用于普及罕见病知识的上下文感知大型语言模型)

[05:12] 👕 GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details(GarVerseLOD:利用多层次细节数据集从单张自然图像中进行高保真3D服装重建)

[05:46] 🔍 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation(目标检测性能与视觉显著性和深度估计的相关性)

[06:28] 🔄 Adaptive Length Image Tokenization via Recurrent Allocation(通过递归分配实现自适应长度图像标记化)

[07:01] 🧠 Inference Optimal VLMs Need Only One Visual Token but Larger Models(推断最优的视觉语言模型仅需一个视觉标记但需要更大的模型)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like HuggingFace 每日AI论文速递

View all
硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

291 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

294 Listeners

思文,败类 by 思文败类

思文,败类

157 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners