November 06, 2024

2024.11.06 每日AI论文 | HTML提升RAG性能，分子图助手优化多模态任务

8 minutes

本期的 11 篇论文如下：

[00:30] 📄 HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems（HtmlRAG：在RAG系统中，HTML比纯文本更适合建模检索知识）

[01:12] 🧬 LLaMo: Large Language Model-based Molecular Graph Assistant（基于大型语言模型的分子图助手）

[01:52] 🤖 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution（DeeR-VLA：动态推理多模态大语言模型以实现高效机器人执行）

[02:28] 🤖 Sample-Efficient Alignment for LLMs（LLM的高效对齐方法）

[03:01] 🚦 Controlling Language and Diffusion Models by Transporting Activations（通过传输激活控制语言和扩散模型）

[03:49] 🌟 DreamPolish: Domain Score Distillation With Progressive Geometry Generation（梦幻抛光：基于渐进几何生成的领域分数蒸馏）

[04:32] 🦓 Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge（斑马-羊驼：一种用于普及罕见病知识的上下文感知大型语言模型）

[05:12] 👕 GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details（GarVerseLOD：利用多层次细节数据集从单张自然图像中进行高保真3D服装重建）

[05:46] 🔍 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation（目标检测性能与视觉显著性和深度估计的相关性）

[06:28] 🔄 Adaptive Length Image Tokenization via Recurrent Allocation（通过递归分配实现自适应长度图像标记化）

[07:01] 🧠 Inference Optimal VLMs Need Only One Visual Token but Larger Models（推断最优的视觉语言模型仅需一个视觉标记但需要更大的模型）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

View all episodes

By duan

22 ratings

November 06, 2024

2024.11.06 每日AI论文 | HTML提升RAG性能，分子图助手优化多模态任务

8 minutes

本期的 11 篇论文如下：

[00:30] 📄 HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems（HtmlRAG：在RAG系统中，HTML比纯文本更适合建模检索知识）

[01:12] 🧬 LLaMo: Large Language Model-based Molecular Graph Assistant（基于大型语言模型的分子图助手）

[01:52] 🤖 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution（DeeR-VLA：动态推理多模态大语言模型以实现高效机器人执行）

[02:28] 🤖 Sample-Efficient Alignment for LLMs（LLM的高效对齐方法）

[03:01] 🚦 Controlling Language and Diffusion Models by Transporting Activations（通过传输激活控制语言和扩散模型）

[03:49] 🌟 DreamPolish: Domain Score Distillation With Progressive Geometry Generation（梦幻抛光：基于渐进几何生成的领域分数蒸馏）

[04:32] 🦓 Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge（斑马-羊驼：一种用于普及罕见病知识的上下文感知大型语言模型）

[05:46] 🔍 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation（目标检测性能与视觉显著性和深度估计的相关性）

[06:28] 🔄 Adaptive Length Image Tokenization via Recurrent Allocation（通过递归分配实现自适应长度图像标记化）

[07:01] 🧠 Inference Optimal VLMs Need Only One Visual Token but Larger Models（推断最优的视觉语言模型仅需一个视觉标记但需要更大的模型）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

More shows like HuggingFace 每日AI论文速递

View all

硅谷101|中国版

56 Listeners

商业就是这样

291 Listeners

声动早咖啡

294 Listeners

思文，败类

157 Listeners

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活

0 Listeners

Share 2024.11.06 每日AI论文 | HTML提升RAG性能，分子图助手优化多模态任务

Sign up to save your podcasts

2024.11.06 每日AI论文 | HTML提升RAG性能，分子图助手优化多模态任务

2024.11.06 每日AI论文 | HTML提升RAG性能，分子图助手优化多模态任务

More shows like HuggingFace 每日AI论文速递

硅谷101|中国版

商业就是这样

声动早咖啡

思文，败类

不开玩笑 Jokes Aside

人民公园说AI

數創實驗室 - AI時代的學習指南

AI可可AI生活