Code[ish]

I Was There: Stories of Production Incidents


Listen Later

Corey Martin values storytelling. It's just one way developers can share their experiences in order for others to take lessons. To that end, this episode takes a close look at production issues from two different applications to examine what went wrong and how it was fixed.

Meg Viar is a Senior Software Developer at Nomadic Learning, an e-learning platform. One day, they noticed that, for a certain group of users, a column of information in their database row was nulled. It didn't look like any user--either internally or externally--intentionally changed these values, and there hadn't been any new code deployed in days. The only clue was that the data was all changed at the same time. It turned out that a weekly cron job was deleting some data on an in-memory list. However, the database ORM they use also overloads the delete keyword, and was actually deleting the production data. Restoring the data from a backup was easy, and reworking the code to not use the data was a quick fix. However, going forward, Meg and her team came up with several ways to adjust the process around code changes like this from occurring again.

Brendan Hennessy is the co-founder and CTO at Launchpad Lab, a studio that builds custom web and mobile applications. One of their clients is an SAT/ACT test prep app, and students complained that the app was extraordinarily slow. Brendan was accustomed to seeing such feedback on testing days, when heavy volume brought added strain to servers, and they accounted for this by increasing capacity. But this was different: there weren't any tests scheduled during the period. Instead, one of their own services was inadvertently DDOSing an endpoint, expecting a response; when one didn't arrive, it just kept making requests. They reworked this code to make a request once and simply wait for a response without trying again. In the future, they committed themselves to doing more in-person blitzes of new features, since issues like this only arise after multiple users use the app--something automated tests have trouble simulating.

Links from this episode
  • Nomadic Learning builds digital academies
  • Launchpad Lab builds custom web and mobile applications for startups and established businesses
...more
View all episodesView all episodes
Download on the App Store

Code[ish]By Heroku from Salesforce

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

18 ratings


More shows like Code[ish]

View all
TED Radio Hour by NPR

TED Radio Hour

22,003 Listeners

Planet Money by NPR

Planet Money

30,670 Listeners

Global News Podcast by BBC World Service

Global News Podcast

7,681 Listeners

Economist Podcasts by The Economist

Economist Podcasts

4,178 Listeners

This Week in Startups by Jason Calacanis

This Week in Startups

1,283 Listeners

Accidental Tech Podcast by Marco Arment, Casey Liss, John Siracusa

Accidental Tech Podcast

2,126 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

624 Listeners

Science Vs by Spotify Studios

Science Vs

12,196 Listeners

The Salesforce Admins Podcast by Mike Gerholdt

The Salesforce Admins Podcast

205 Listeners

The Daily by The New York Times

The Daily

112,454 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

987 Listeners

The Diary Of A CEO with Steven Bartlett by DOAC

The Diary Of A CEO with Steven Bartlett

8,434 Listeners

Darknet Diaries by Jack Rhysider

Darknet Diaries

7,999 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,829 Listeners

Plain English with Derek Thompson by The Ringer

Plain English with Derek Thompson

2,286 Listeners