HuggingFace 每日AI论文速递

2025.01.30 | 批评提升推理,AI能耗引关注


Listen Later

本期的 5 篇论文如下:

[00:25] 🧠 Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate(批评微调:学习批评比学习模仿更有效)

[01:10] 🌍 Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts(探索AI可持续扩展的困境:企业AI环境影响的预测性研究)

[01:50] 🌟 Atla Selene Mini: A General Purpose Evaluation Model(Atla Selene Mini:一种通用评估模型)

[02:27] ⚠ Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation(OpenAI的o3-mini早期外部安全测试:部署前评估的见解)

[03:06] 🦠 Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation(病毒:绕过防护机制的大语言模型有害微调攻击)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan