
Sign up to save your podcasts
Or


OpenAI recently released its first open-weights model since GPT-2, entering a field led by DeepSeek and Alibaba's Qwen.YC's Ankit Gupta breaks down everything you need to know about these top OSS models, including what sets them apart under the hood. He’ll compare their approaches to mixture-of-experts, long-context training, and post-training techniques that shape reasoning and alignment—and explore how different design choices lead to surprisingly similar performance.
By Y Combinator4.6
215215 ratings
OpenAI recently released its first open-weights model since GPT-2, entering a field led by DeepSeek and Alibaba's Qwen.YC's Ankit Gupta breaks down everything you need to know about these top OSS models, including what sets them apart under the hood. He’ll compare their approaches to mixture-of-experts, long-context training, and post-training techniques that shape reasoning and alignment—and explore how different design choices lead to surprisingly similar performance.

1,296 Listeners

536 Listeners

1,105 Listeners

2,221 Listeners

341 Listeners

212 Listeners

551 Listeners

300 Listeners

211 Listeners

150 Listeners

101 Listeners

475 Listeners

34 Listeners

21 Listeners

42 Listeners