HuggingFace 每日AI论文速递

2024.09.23 每日AI论文 | 无调优个性化图像生成,多模态讽刺理解评估


Listen Later

本期的 11 篇论文如下:

[00:26] 🎨 Imagine yourself: Tuning-Free Personalized Image Generation(想象自己:无调优个性化图像生成)

[01:02] 😂 YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models(YesBut:评估视觉语言模型讽刺理解能力的高质量多模态数据集)

[01:40] 🌍 Prithvi WxC: Foundation Model for Weather and Climate(Prithvi WxC:天气和气候的基础模型)

[02:15] 🎵 MuCodec: Ultra Low-Bitrate Music Codec(MuCodec:超低比特率音乐编解码器)

[02:51] 🌈 Colorful Diffuse Intrinsic Image Decomposition in the Wild(在野外进行彩色漫反射内在图像分解)

[03:29] 🎥 Portrait Video Editing Empowered by Multimodal Generative Priors(基于多模态生成先验的肖像视频编辑)

[04:01] 🎥 Temporally Aligned Audio for Video with Autoregression(基于自回归的视频音频时间对齐生成)

[04:38] 📱 V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians(V^3:通过可流式2D动态高斯函数在移动设备上观看体积视频)

[05:21] 📚 Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation(事实、获取与推理:检索增强生成的统一评估)

[05:57] 🛡 Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments(Hackphyr:用于网络安全环境的本地微调LLM代理)

[06:34] 🎻 Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI Experts(Minstrel:面向非AI专家的多智能体协同结构化提示生成)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan