
Sign up to save your podcasts
Or


Three Chinese AI labs just released models that are rewriting the leaderboards. Moonshot AI's Kimi K2.5 can spin up a hundred agents working in parallel and scored 74.9% on BrowseComp, seventeen points ahead of GPT-5.2. Alibaba's Qwen3-Max-Thinking hit 58.3 on Humanity's Last Exam with perfect scores on AIME 2025. And Zhipu AI's GLM-5 matches Claude Opus 4.6 on SWE-bench Verified at a fraction of the cost. All three are open source. We break down what each one does, why it matters, and what it means for developers and builders.
Sources: Moonshot AI (kimi.com), Alibaba Qwen (huggingface.co/Qwen), Zhipu AI (zhipuai.cn), TechCrunch, InfoQ, RAND Corporation.
By Pallav TyagiThree Chinese AI labs just released models that are rewriting the leaderboards. Moonshot AI's Kimi K2.5 can spin up a hundred agents working in parallel and scored 74.9% on BrowseComp, seventeen points ahead of GPT-5.2. Alibaba's Qwen3-Max-Thinking hit 58.3 on Humanity's Last Exam with perfect scores on AIME 2025. And Zhipu AI's GLM-5 matches Claude Opus 4.6 on SWE-bench Verified at a fraction of the cost. All three are open source. We break down what each one does, why it matters, and what it means for developers and builders.
Sources: Moonshot AI (kimi.com), Alibaba Qwen (huggingface.co/Qwen), Zhipu AI (zhipuai.cn), TechCrunch, InfoQ, RAND Corporation.