February 15, 2025

【周末特辑】2月第2周最火AI论文 | 1B LLM如何超越405B LLM；金融领域长上下文QA基准测试

13 minutes

本期的 5 篇论文如下：

[00:54] TOP1(🔥121) | 🤔 Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling（10亿参数LLM能否超越4050亿参数LLM？重新思考计算最优的测试时缩放）

[03:41] TOP2(🔥119) | 🚀 InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU（InfiniteHiP：在单个GPU上扩展语言模型上下文至300万 tokens）

[06:11] TOP3(🔥117) | 💼 Expect the Unexpected: FailSafe Long Context QA for Finance（预料之外：金融领域长上下文问答的FailSafe）

[08:23] TOP4(🔥104) | 🦜 The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding（随机鹦鹉在大语言模型肩上：物理概念理解的总结性评估）

[10:40] TOP5(🔥100) | 🧠 Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach（通过潜在推理扩展测试时计算：一种递归深度方法）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

View all episodes

By duan

22 ratings

February 15, 2025

【周末特辑】2月第2周最火AI论文 | 1B LLM如何超越405B LLM；金融领域长上下文QA基准测试

13 minutes

本期的 5 篇论文如下：

[00:54] TOP1(🔥121) | 🤔 Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling（10亿参数LLM能否超越4050亿参数LLM？重新思考计算最优的测试时缩放）

[03:41] TOP2(🔥119) | 🚀 InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU（InfiniteHiP：在单个GPU上扩展语言模型上下文至300万 tokens）

[06:11] TOP3(🔥117) | 💼 Expect the Unexpected: FailSafe Long Context QA for Finance（预料之外：金融领域长上下文问答的FailSafe）

[10:40] TOP5(🔥100) | 🧠 Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach（通过潜在推理扩展测试时计算：一种递归深度方法）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

More shows like HuggingFace 每日AI论文速递

View all

硅谷101|中国版

56 Listeners

商业就是这样

292 Listeners

声动早咖啡

293 Listeners

思文，败类

157 Listeners

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活

0 Listeners

Share 【周末特辑】2月第2周最火AI论文 | 1B LLM如何超越405B LLM；金融领域长上下文QA基准测试

Sign up to save your podcasts

【周末特辑】2月第2周最火AI论文 | 1B LLM如何超越405B LLM；金融领域长上下文QA基准测试

【周末特辑】2月第2周最火AI论文 | 1B LLM如何超越405B LLM；金融领域长上下文QA基准测试

More shows like HuggingFace 每日AI论文速递

硅谷101|中国版

商业就是这样

声动早咖啡

思文，败类

不开玩笑 Jokes Aside

人民公园说AI

數創實驗室 - AI時代的學習指南

AI可可AI生活