
Sign up to save your podcasts
Or


OpenAI recently released its first open-weights model since GPT-2, entering a field led by DeepSeek and Alibaba's Qwen.YC's Ankit Gupta breaks down everything you need to know about these top OSS models, including what sets them apart under the hood. He’ll compare their approaches to mixture-of-experts, long-context training, and post-training techniques that shape reasoning and alignment—and explore how different design choices lead to surprisingly similar performance.
By Y Combinator4.6
214214 ratings
OpenAI recently released its first open-weights model since GPT-2, entering a field led by DeepSeek and Alibaba's Qwen.YC's Ankit Gupta breaks down everything you need to know about these top OSS models, including what sets them apart under the hood. He’ll compare their approaches to mixture-of-experts, long-context training, and post-training techniques that shape reasoning and alignment—and explore how different design choices lead to surprisingly similar performance.

1,288 Listeners

537 Listeners

175 Listeners

703 Listeners

1,089 Listeners

3,990 Listeners

511 Listeners

205 Listeners

131 Listeners

467 Listeners

35 Listeners

21 Listeners

40 Listeners

44 Listeners

47 Listeners