
Sign up to save your podcasts
Or
本期的 5 篇论文如下:
[00:33] TOP1(🔥108) | 💡 Kuwain 1.5B: An Arabic SLM via Language Injection(Kuwain 1.5B:一种基于语言注入的阿拉伯语SLM)
[02:43] TOP2(🔥98) | 🤔 Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?(强化学习真的能激励大语言模型产生超越基础模型的推理能力吗?)
[04:58] TOP3(🔥78) | 🤖 TTRL: Test-Time Reinforcement Learning(测试时强化学习)
[07:12] TOP4(🔥71) | 💡 Learning to Reason under Off-Policy Guidance(基于离策略指导的学习推理)
[09:12] TOP5(🔥62) | 🦅 Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models(Eagle 2.5:提升前沿视觉-语言模型长文本后训练性能)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
本期的 5 篇论文如下:
[00:33] TOP1(🔥108) | 💡 Kuwain 1.5B: An Arabic SLM via Language Injection(Kuwain 1.5B:一种基于语言注入的阿拉伯语SLM)
[02:43] TOP2(🔥98) | 🤔 Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?(强化学习真的能激励大语言模型产生超越基础模型的推理能力吗?)
[04:58] TOP3(🔥78) | 🤖 TTRL: Test-Time Reinforcement Learning(测试时强化学习)
[07:12] TOP4(🔥71) | 💡 Learning to Reason under Off-Policy Guidance(基于离策略指导的学习推理)
[09:12] TOP5(🔥62) | 🦅 Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models(Eagle 2.5:提升前沿视觉-语言模型长文本后训练性能)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递