The latest SWE-bench results show AI coding agents hitting 79% accuracy, nearly matching human engineers. But is this real progress or just sophisticated memorization? We explore the hidden role of agent scaffolds, the shocking cost differences between models, and why harder benchmarks reveal a 40-point performance drop.

The 79% AI Coder: Genius or Just Memorization?

A man, a sloth, and a donkey collaborate to create a podcast (with a little help from AI). No question is too obscure, no rabbit hole too deep. My Weird Prompts celebrates curiosity in all its forms. Daniel, the human, asks the questions that pop into his head at inconvenient moments. Corn the Sloth offers laid-back, thoughtful takes. Herman the Donkey brings boundless enthusiasm and energy. Together, they explore topics ranging from the mundane to the mind-bending. Each episode begins with a real voice memo from Daniel, processed through an AI pipeline that generates scripts, synthesizes voices, and assembles the final podcast. Stay curious.

Share The 79% AI Coder: Genius or Just Memorization?

Sign up to save your podcasts

The 79% AI Coder: Genius or Just Memorization?

The 79% AI Coder: Genius or Just Memorization?