
Sign up to save your podcasts
Or
本期的 15 篇论文如下:
[00:21] 🎬 Paper2Video: Automatic Video Generation from Scientific Papers(论文自动生成学术演讲视频)
[00:55] 🎬 Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models(Video-LMM后训练:深入剖析大型多模态模型的视频推理)
[01:38] 🎬 VChain: Chain-of-Visual-Thought for Reasoning in Video Generation(VChain:面向视频生成推理的视觉思维链)
[02:14] 👻 Imperceptible Jailbreaking against Large Language Models(针对大语言模型的隐形越狱攻击)
[02:56] 🌳 MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information(MITS:基于点互信息的树搜索增强大模型推理)
[03:30] 🧬 Hybrid Architectures for Language Models: Systematic Analysis and Design Insights(语言模型混合架构:系统剖析与设计洞见)
[04:07] 📊 Factuality Matters: When Image Generation and Editing Meet Structured Visuals(事实至关重要:当图像生成与编辑遇上结构化视觉)
[04:59] 🔄 Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models(反应式Transformer:事件驱动的实时有状态对话模型)
[05:55] ⚖ Judging with Confidence: Calibrating Autoraters to Preference Distributions(置信评判:将自动评分器校准到偏好分布)
[06:44] 🎯 Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training(Reinforce-Ada:面向Reinforce风格LLM训练的自适应采样框架)
[07:27] 📏 Optimal Scaling Needs Optimal Norm(最优扩放需要最优范数)
[07:51] 🔬 Code4MeV2: a Research-oriented Code-completion Platform(Code4MeV2:面向研究的代码补全平台)
[08:31] 🪞 Self-Reflective Generation at Test Time(测试时自反思生成)
[09:15] 🔄 SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs(SwiReasoning:在显式与潜空间之间切换思维,实现帕累托更优的推理大模型)
[10:00] 👀 Watch and Learn: Learning to Use Computers from Online Videos(观看与学习:从在线视频中学习使用计算机)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
本期的 15 篇论文如下:
[00:21] 🎬 Paper2Video: Automatic Video Generation from Scientific Papers(论文自动生成学术演讲视频)
[00:55] 🎬 Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models(Video-LMM后训练:深入剖析大型多模态模型的视频推理)
[01:38] 🎬 VChain: Chain-of-Visual-Thought for Reasoning in Video Generation(VChain:面向视频生成推理的视觉思维链)
[02:14] 👻 Imperceptible Jailbreaking against Large Language Models(针对大语言模型的隐形越狱攻击)
[02:56] 🌳 MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information(MITS:基于点互信息的树搜索增强大模型推理)
[03:30] 🧬 Hybrid Architectures for Language Models: Systematic Analysis and Design Insights(语言模型混合架构:系统剖析与设计洞见)
[04:07] 📊 Factuality Matters: When Image Generation and Editing Meet Structured Visuals(事实至关重要:当图像生成与编辑遇上结构化视觉)
[04:59] 🔄 Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models(反应式Transformer:事件驱动的实时有状态对话模型)
[05:55] ⚖ Judging with Confidence: Calibrating Autoraters to Preference Distributions(置信评判:将自动评分器校准到偏好分布)
[06:44] 🎯 Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training(Reinforce-Ada:面向Reinforce风格LLM训练的自适应采样框架)
[07:27] 📏 Optimal Scaling Needs Optimal Norm(最优扩放需要最优范数)
[07:51] 🔬 Code4MeV2: a Research-oriented Code-completion Platform(Code4MeV2:面向研究的代码补全平台)
[08:31] 🪞 Self-Reflective Generation at Test Time(测试时自反思生成)
[09:15] 🔄 SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs(SwiReasoning:在显式与潜空间之间切换思维,实现帕累托更优的推理大模型)
[10:00] 👀 Watch and Learn: Learning to Use Computers from Online Videos(观看与学习:从在线视频中学习使用计算机)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递