
Sign up to save your podcasts
Or


本期的 6 篇论文如下:
[00:19] 🎬 Video Generation Models Are Good Latent Reward Models(视频生成模型是优秀的潜在奖励模型)
[01:07] 🎨 Canvas-to-Image: Compositional Image Generation with Multimodal Controls(画布到图像:基于多模态控制的组合式图像生成)
[01:49] 🎨 MIRA: Multimodal Iterative Reasoning Agent for Image Editing(MIRA:多模态迭代推理代理用于图像编辑)
[02:30] 📊 Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following(多准则:多模态评估器在多元化标准遵循上的基准测试)
[03:12] 🧠 What does it mean to understand language?(理解语言意味着什么?)
[03:47] 🧠 Agentic Learner with Grow-and-Refine Multimodal Semantic Memory(具有生长与精炼多模态语义记忆的自主学习者)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
By duan5
22 ratings
本期的 6 篇论文如下:
[00:19] 🎬 Video Generation Models Are Good Latent Reward Models(视频生成模型是优秀的潜在奖励模型)
[01:07] 🎨 Canvas-to-Image: Compositional Image Generation with Multimodal Controls(画布到图像:基于多模态控制的组合式图像生成)
[01:49] 🎨 MIRA: Multimodal Iterative Reasoning Agent for Image Editing(MIRA:多模态迭代推理代理用于图像编辑)
[02:30] 📊 Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following(多准则:多模态评估器在多元化标准遵循上的基准测试)
[03:12] 🧠 What does it mean to understand language?(理解语言意味着什么?)
[03:47] 🧠 Agentic Learner with Grow-and-Refine Multimodal Semantic Memory(具有生长与精炼多模态语义记忆的自主学习者)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

56 Listeners

292 Listeners

293 Listeners

157 Listeners

136 Listeners

7 Listeners

1 Listeners

0 Listeners