HuggingFace 每日AI论文速递

2025.04.29 | RepText提升多语言文本渲染;LLM改进手机GUI自动化。


Listen Later

本期的 11 篇论文如下:

[00:23] ✍ RepText: Rendering Visual Text via Replicating(RepText:通过复制渲染视觉文本)

[01:02] 📱 LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects(LLM驱动的手机GUI代理:进展与展望)

[01:44] 🔐 CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges(CipherBank:通过密码学挑战探索大型语言模型推理能力的边界)

[02:30] 🤔 Clinical knowledge in LLMs does not translate to human interactions(大型语言模型中的临床知识未能转化为人际互动)

[03:16] ⬇ Group Downsampling with Equivariant Anti-aliasing(群等变抗锯齿降采样)

[03:59] 📐 TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving(TrustGeoGen:用于可信多模态几何问题求解的可扩展且形式验证的数据引擎)

[04:39] 🤖 SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning(SPC:通过对抗博弈演进自博弈评论器以提升大型语言模型推理能力)

[05:30] 🖼 Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency(基于显式视觉依赖的多模态数学推理能力基准测试)

[06:15] 🚀 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention(MMInference:通过模态感知置换稀疏注意力加速长文本VLM的预填充)

[06:49] 🔑 ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers(ICL密码:通过替换密码量化上下文学习中的“学习”)

[07:30] 💡 ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development(ChiseLLM:释放推理LLM在Chisel敏捷硬件开发中的力量)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

...more
View all episodesView all episodes
Download on the App Store

HuggingFace 每日AI论文速递By duan

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like HuggingFace 每日AI论文速递

View all
硅谷101|中国版 by 泓君Jane

硅谷101|中国版

56 Listeners

商业就是这样 by 商业就是这样

商业就是这样

292 Listeners

声动早咖啡 by 声动活泼

声动早咖啡

293 Listeners

思文,败类 by 思文败类

思文,败类

156 Listeners

不开玩笑 Jokes Aside by 不开玩笑JokesAside

不开玩笑 Jokes Aside

136 Listeners

人民公园说AI by JustSayAI

人民公园说AI

7 Listeners

數創實驗室 - AI時代的學習指南 by Vincent在數創

數創實驗室 - AI時代的學習指南

1 Listeners

AI可可AI生活 by fly51fly

AI可可AI生活

0 Listeners