Humans of Reliability

Scientific Incident Management with Dan Slimmon


Listen Later

Dan Slimmon is an incident management veteran who's worked at Etsy, HashiCorp, and now leads consulting and training on pragmatic, non-bureaucratic incident response. 

In this episode, Dan shares his philosophy on "scientific incident response," the importance of hypothesis-driven troubleshooting, and why incidents should be seen as normal in complex systems. 

We also explore:

  • Why asking the right questions is more important than knowing all the answers. 
  • How to use nerd sniping to unlock insights from engineers. 
  • Common failure patterns he sees across organizations. 

EPISODE LINKS: 

  • Video and key takeaways 
  • D2E Incident Leadership Course 
...more
View all episodesView all episodes
Download on the App Store

Humans of ReliabilityBy Rootly