
Sign up to save your podcasts
Or
本期的 11 篇论文如下:
[00:24] 📊 The Lessons of Developing Process Reward Models in Mathematical Reasoning(数学推理中过程奖励模型开发的经验教训)
[01:10] 🧠 Tensor Product Attention Is All You Need(张量积注意力机制是关键)
[01:53] 🤖 $\text{Transformer}^2$: Self-adaptive LLMs(Transformer²:自适应大型语言模型)
[02:34] 🎥 VideoAuteur: Towards Long Narrative Video Generation(视频导演:面向长篇叙事视频生成)
[03:22] 🌐 WebWalker: Benchmarking LLMs in Web Traversal(WebWalker:在网页遍历中评估大语言模型)
[04:08] 🩺 O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning(O1复现之旅 -- 第三部分:医疗推理的推理时间扩展)
[04:50] 🗣 MinMo: A Multimodal Large Language Model for Seamless Voice Interaction(MinMo:一种用于无缝语音交互的多模态大型语言模型)
[05:41] 🔧 SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training(SPAM:带动量重置的尖峰感知Adam优化器用于稳定LLM训练)
[06:25] 🩺 BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature(BIOMEDICA:一个开放的生物医学图像-文本档案、数据集及从科学文献中衍生出的视觉语言模型)
[07:15] 🧪 ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning(ChemAgent:大型语言模型中自更新库提升化学推理能力)
[07:51] 🌐 UnCommon Objects in 3D(三维中的不常见物体)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
本期的 11 篇论文如下:
[00:24] 📊 The Lessons of Developing Process Reward Models in Mathematical Reasoning(数学推理中过程奖励模型开发的经验教训)
[01:10] 🧠 Tensor Product Attention Is All You Need(张量积注意力机制是关键)
[01:53] 🤖 $\text{Transformer}^2$: Self-adaptive LLMs(Transformer²:自适应大型语言模型)
[02:34] 🎥 VideoAuteur: Towards Long Narrative Video Generation(视频导演:面向长篇叙事视频生成)
[03:22] 🌐 WebWalker: Benchmarking LLMs in Web Traversal(WebWalker:在网页遍历中评估大语言模型)
[04:08] 🩺 O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning(O1复现之旅 -- 第三部分:医疗推理的推理时间扩展)
[04:50] 🗣 MinMo: A Multimodal Large Language Model for Seamless Voice Interaction(MinMo:一种用于无缝语音交互的多模态大型语言模型)
[05:41] 🔧 SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training(SPAM:带动量重置的尖峰感知Adam优化器用于稳定LLM训练)
[06:25] 🩺 BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature(BIOMEDICA:一个开放的生物医学图像-文本档案、数据集及从科学文献中衍生出的视觉语言模型)
[07:15] 🧪 ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning(ChemAgent:大型语言模型中自更新库提升化学推理能力)
[07:51] 🌐 UnCommon Objects in 3D(三维中的不常见物体)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递