Ship AI

Episode 1: GPT-5, Opus 4.1, GPT-OSS-2B, and more!


Listen Later

In this first episode, Manav Gupta and Mihai Criveti put the latest AI models through their paces in a head-to-head coding challenge. Watch as Claude Opus 4.1, GPT-5, and the open-source GPT OSS 20 billion compete to build interactive games and applications from simple prompts.

Highlights:

  • Live coding challenges including Snake, Minesweeper, and a Prince of Persia clone
  • Real-time comparison of how each model handles game development, from basic functionality to "kawaii" styling
  • Testing complex technical tasks like creating IBM mainframe architecture diagrams
  • Classic AI benchmark tests (counting letters, arithmetic problems) with surprising results
  • Mihai runs GPT OSS locally on his own GPU, showcasing impressive open-source capabilities

Key Takeaways:

  • Claude Opus 4.1 emerges as the overall winner with cleaner interfaces and superior artifact management
  • GPT-5 shows promise but struggles with canvas implementation
  • Open-source models are rapidly closing the gap with commercial offerings
  • Discussion on how these tools are reshaping the future of software development
...more
View all episodesView all episodes
Download on the App Store

Ship AIBy Manav Gupta