My Weird Prompts

The 79% AI Coder: Genius or Just Memorization?


Listen Later

The latest SWE-bench results show AI coding agents hitting 79% accuracy, nearly matching human engineers. But is this real progress or just sophisticated memorization? We explore the hidden role of agent scaffolds, the shocking cost differences between models, and why harder benchmarks reveal a 40-point performance drop.
...more
View all episodesView all episodes
Download on the App Store

My Weird PromptsBy Daniel Rosehill