
Sign up to save your podcasts
Or


Why do major retailers with unlimited budgets still crash on Black Friday? This episode dives into the graveyard of e-commerce outages—from J.Crew's $775,000 five-hour crash to the AWS typo that cost $150 million.
In this Black Friday special episode, we examine:
📊 THE HALL OF FAME CRASHES
💥 THE FAMOUS NON-BLACK-FRIDAY DISASTERS
🛡️ THE PLATFORM ENGINEER'S PLAYBOOK
The uncomfortable truth: These outages aren't caused by lack of budget or talent. They're caused by complexity, assumptions, and the gap between "should work" and "actually tested."
🔗 Full transcript & notes: https://platformengineeringplaybook.com/podcasts/00039-black-friday-war-stories
Episode Tags: Black Friday, e-commerce outages, AWS S3, GitLab, Kubernetes, platform engineering, SRE, incident response, chaos engineering, load testing
By vibesreWhy do major retailers with unlimited budgets still crash on Black Friday? This episode dives into the graveyard of e-commerce outages—from J.Crew's $775,000 five-hour crash to the AWS typo that cost $150 million.
In this Black Friday special episode, we examine:
📊 THE HALL OF FAME CRASHES
💥 THE FAMOUS NON-BLACK-FRIDAY DISASTERS
🛡️ THE PLATFORM ENGINEER'S PLAYBOOK
The uncomfortable truth: These outages aren't caused by lack of budget or talent. They're caused by complexity, assumptions, and the gap between "should work" and "actually tested."
🔗 Full transcript & notes: https://platformengineeringplaybook.com/podcasts/00039-black-friday-war-stories
Episode Tags: Black Friday, e-commerce outages, AWS S3, GitLab, Kubernetes, platform engineering, SRE, incident response, chaos engineering, load testing