Paul Zaich from Checkr tells us about a critical outage that occurred, what caused it and how they tracked down and fixed the issue. The conversation ranges through troubleshooting complex systems, building team culture, blameless post-mortems, and monitoring the right things to make sure your applications don't fail or alert you when they do.Panel
- Charles Max Wood
- Dave Kimura
- Luke Stutters
Guest
Links
- Paul's Twitter
- Paul's LinkedIn
Picks
- Blood Pressure Monitor - Dave
- eft - Luke
- Ruby one-liners cookbook - Paul
- Podcast Growth Summit - Chuck
- Most Valuable Dev - Chuck
- Most Valuable Dev Summit - Chuck
- Mushroom Wars - Chuck
- Gmelius - Chuck
Special Guest: Paul Zaich.
Advertising Inquiries: https://redcircle.com/brands
Privacy & Opt-Out: https://redcircle.com/privacy
Become a supporter of this podcast: https://www.spreaker.com/podcast/ruby-rogues--6102073/support.