January 30, 2025

2025.01.30 | 批评提升推理，AI能耗引关注

4 minutes

本期的 5 篇论文如下：

[00:25] 🧠 Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate（批评微调：学习批评比学习模仿更有效）

[01:10] 🌍 Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts（探索AI可持续扩展的困境：企业AI环境影响的预测性研究）

[01:50] 🌟 Atla Selene Mini: A General Purpose Evaluation Model（Atla Selene Mini：一种通用评估模型）

[02:27] ⚠ Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation（OpenAI的o3-mini早期外部安全测试：部署前评估的见解）

[03:06] 🦠 Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation（病毒：绕过防护机制的大语言模型有害微调攻击）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

View all episodes

By duan

22 ratings

January 30, 2025

2025.01.30 | 批评提升推理，AI能耗引关注

4 minutes

本期的 5 篇论文如下：

[00:25] 🧠 Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate（批评微调：学习批评比学习模仿更有效）

[01:50] 🌟 Atla Selene Mini: A General Purpose Evaluation Model（Atla Selene Mini：一种通用评估模型）

[02:27] ⚠ Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation（OpenAI的o3-mini早期外部安全测试：部署前评估的见解）

[03:06] 🦠 Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation（病毒：绕过防护机制的大语言模型有害微调攻击）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

...more

More shows like HuggingFace 每日AI论文速递

View all

硅谷101|中国版

56 Listeners

商业就是这样

292 Listeners

声动早咖啡

293 Listeners

思文，败类

157 Listeners

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活

0 Listeners

Share 2025.01.30 | 批评提升推理，AI能耗引关注

Sign up to save your podcasts

2025.01.30 | 批评提升推理，AI能耗引关注

2025.01.30 | 批评提升推理，AI能耗引关注

More shows like HuggingFace 每日AI论文速递

硅谷101|中国版

商业就是这样

声动早咖啡

思文，败类

不开玩笑 Jokes Aside

人民公园说AI

數創實驗室 - AI時代的學習指南

AI可可AI生活