
Sign up to save your podcasts
Or


Legendary DevOps friend Matt Stratton joins Ben for a discussion on the very near and dear topic of organizational trauma. Matt shares important lessons from his experiences, as well as a few secrets to building organizational resilience through incident response. (Content warning: General discussion of trauma.)
Matt Stratton is a DevOps Advocate at PagerDuty, where he helps dev and ops teams advance the practice of their craft and become more operationally mature. He collaborates with PagerDuty customers and industry thought leaders in the broader DevOps community, and back in the day, his license plate actually said “DevOps”.
Matt has over 20 years experience in IT operations, ranging from large financial institutions such as JPMorganChase and internet firms, including Apartments.com. He is a sought-after speaker internationally, presenting at Agile, DevOps, and ITSM focused events, including ChefConf, DevOpsDays, Interop, PINK, and others worldwide. Matt is the founder and co-host of the popular Arrested DevOps podcast, as well as a global organizer of the DevOpsDays set of conferences.
He lives in Chicago and has three awesome kids, whom he loves just a little bit more than he loves Doctor Who. He is currently on a mission to discover the best phở in the world.
Hey @nashvilledevops folks! Slides from my talk “Fight, Flight, or Freeze – Releasing Organizational Trauma” are online now at https://t.co/diddN5Yihd #DevOpsDays
— matty stratton (@mattstratton) May 10, 2019I would like to take a minute to thank @jpaulreed for encouraging me to initially submit my talk about organizational trauma. It’s been proving to be my most successful talk ever and I’m so happy every time I get to share it with a new audience. Thanks, Preed!
— matty stratton (@mattstratton) March 1, 2019Intro / Outro Music by DJ Quads
https://soundcloud.com/djquads
Ben: [00:00:00]
We’re not going to discuss our own individual specific traumatic experiences. However, we are going to talk in general about the topic and get into how it relates to organizations as well. So with that intro, I am just absolutely floored to have Matt with us here today.
I first kind of came into contact with Matt around 2014
Matt, I’m so happy to have you here today.
Matt: [00:00:56] Thank you for having me. I’m really excited, to be a part and I, I love being on podcasts. As you said. you know, I, I have my own show and it’s always really fun as a podcaster to be a guest, cause you’re like… You’ve got all this stuff… What your listeners don’t know, there’s all these things going on in your head cause you’re running the show and I’m like, I just get to talk! It’s so refreshing.
Ben: [00:01:16] Thank you for being here. I gave a bit of an intro and I think that relates you to my audience, but I was just kind of wondering, what else do you think they should know before we kind of dive into this really interesting topic of organizational trauma?
Matt: [00:01:28] Sure. So I work for PagerDuty now, as an advocate. And the reason I’m kind of bringing that up is, my focus, especially over the last couple of years, has been a lot around thinking about incident response, learning from incidents.
And I’ve been doing a lot of thought around incidents and
So, kind of having that thought and background, I think is
Ben: [00:02:13]
Matt: [00:02:28]
Ben: [00:02:45] This is a such an interesting topic because I think in general, trauma, as a topic for both individuals and organizations, and frankly from a societal and cultural standpoint as well, it’s a bit stigmatized. And I think a lot of times it’s because it’s tied up with things like shame or, not having the right kind of language to express it in a healthy way, or it just hurts, and so you avoid it. And so when someone comes out with a talk, focused on something very close and near and dear to me like this, I am just absolutely floored that we’re sharing it and talking about this topic.
So what was the background behind the motivation to submit
Matt: [00:03:31] So it’s, it’s, it’s kind of a fun story. As fun as anything connected to trauma can be, I realize. So back in the spring of 2018, I spent a month in a rehab facility, for treatment, for trauma, for my post traumatic stress, among a couple of other things. And when I was there, I went through some treatments involving somatic experiencing and EMDR.
And shortly after I got out, I was visiting my friend Paul, so J Paul Reed in San Francisco, and we were, you know, this was like a week or two after I had gotten out and we were having coffee and just sort of debriefing with him about my experience. We both sort of sat there and were like, this sounds a lot like how companies act when they have outages and everything.
And because this is the other thing, when you, when you do this kind of work, everything becomes a talk, everything in your life, you’re always like, how can I turn this into a talk? I have, I have a, a coworker who joined a monastery, she became a monk, and I was like, Oh, you’re going to come out of that with a great DevOps talk, right?
So, so Paul and I were talking about that, and he said,
Right. And but I wanted to do it because I just wanted to
And, and so I did. And I did. So I gave it for the first
This was really interesting. You know, Paul came to me and said, this needs to be your talk of next year, but it really came from my experiences of like what I was learning about individual treatment for post traumatic stress. And, and seeing that as that metaphor. And I think that’s why it’s been a, topic and a talk that I love to give and it’s been, you know, it’s been great to be able to do it over time and continually refine. Every time I do it, it’s a little better because I, like you talked about like this topic, personal trauma is very stigmatized and I get a lot of folks coming up to me after the talk that are just from a personal level or like, Hey, I have trauma in my background.
This is what resonated. This is what didn’t. And there’s also people who don’t have that background can still get something from the talk. But then every time I have these conversations afterwards, there’s always something that I tweak and I go, Oh, now this is better. So it’s, it’s not like, Oh, it’s getting better just cause I’ve been practicing it more, but had been to be able to develop it just from conversations I’ve had with so many people that you, you don’t really realize how prevalent. This is and people don’t talk about it.
Ben: [00:06:46] that right there I think is
And the way that trauma manifests for individuals and organizations is different, but it has a lot of the same kind of signs and I think sort of comorbidities along with it. It’s interesting that you bring up Paul because Paul, I’m going to get this wrong. So he’s probably gonna like, reach out on Twitter and be like, Hey! But he, he got a master’s right in, in human factors, right? So this, I think is an interesting intersection, right? Where I’ve, as you pointed out on the most recent episode of Arrested DevOps, it’s like anytime someone makes a connection between something, part of the industry that’s been around for awhile, but then applies it to software, it’s like, Oh. I made this, right.
Human factors for software and thinking about outages as traumatic events, and I’ve also been thinking about what layoffs are like for people in organizations and so on. Tell me how that factors into how, what you share about this talk. Like what have you learned about these kinds of events and how organizations can respond to them?
Matt: [00:08:14] So there’s those kind of two, pieces to the, the trauma metaphor, the trauma topic in the talk. so again, an incident can be traumatic to an individual because are participating in it. So I talk in some version of the talk, I talk about self care and talk about things to do.
But the beginning and the first half is a lot more about.
Or rather it’s not that we don’t know how to respond. Our
And what I found was,
So we’re like, “well, it’s happening again” and
The same way that like if something that reminds me of this
Like I’m, I know I’d have to be scared, honestly. I don’t
So we get activated between the sympathetic and the parasympathetic nervous system. We go up and down with activation. That’s just normal. That’s just how we deal, right? Like nobody is even keeled all the time. But, and everybody’s window of tolerance is different. So the window of tolerance is. Beyond which you become activated and you don’t have the healthy response anymore.
Right. And so what you can do is it can either get really
So when we see something that is reminiscent of that, we get into either being hyper aroused or hypo aroused, where we might fear any kind of change, right? We’re where we get into analysis paralysis because. Uh, or, or we become, almost, uh, paranoid, we have over vigilance, right? Where everything is an attack.
Everything is says, and these are, neither of these are
So, so what I found was being able to help people identify
And it’s different for Netflix than it is from Chase than it
Right. Um, cause
Like, well, the good news is there’s some things that we can
Ben: [00:13:04] so there are a couple things in there. One is a, it reminds me of a lot of the work that, you know, I’ve personally done in therapy where a lot of the times it’s about building up the language to describe what is happening so that when it happens, either in that moment. If you have had the chance to do this a couple of times and really reflect on it, or later when you are reflecting back on what happened and trying to understand it, you can have the language to describe, Oh, okay.
So I was stuck on in this moment or I was stuck off. And, it just reminds me a lot just basically having a language for it means that we can together share that experience and reflect on it and actually develop the skills to more quickly bounce back, but also be more resilient to the situation and maybe expand that window over time of what’s, what’s tolerable and, or what’s appropriate.
Matt: [00:13:53]
But the biggest thing when we think about, and again, other disclaimer, I am not a mental health professional. So none of this is mental health advice, but things that I’ve learned, right? So. Like you can’t control how you feel, right? you’re gonna have a feeling and it’s going to happen, and that’s totally okay.
And it’s not bad than it happens. What we have control over
And I think that’s true in the company level too, is to say
I can be known to be a little pedantic about words. Sometimes some words matter, some words don’t. One of the words I’ve been striving to excise from my vocabulary and I, I tried to like get other people along.
This is the word prevention because. you’re setting
Nope. Cause that’s a fool’s errand. You’re not going to do
What you want to do
We’re going to respond, we’re going to feel a certain way.
It’s a bad thing happens, I feel crappy about it, but I have
Don’t tell him I tried to say that, but don’t try to
again, It’s all going back to this idea of, the things are
And an organization, it can happen. We can restore service,
You need to be that. That’s just a thing you do and doesn’t
We’re good at, we’re good at responding to incidents and. The
so I think back, you know, I’ve been been a CIS admin for,
Right? And it’s like back in the day, I was probably, you
It was business as usual, right? So now today you can be,
You’re freak out because it’s like, Whoa, this is something crazy, right? This is something out of the blue and you don’t want to be… Number one, you don’t want to be trying to remember how to do instant response. So that’s why practice makes permanent, but also you don’t want it to be this, as much as we would love incidents to be an exception, and they should be, we don’t want to think about, the, the response process. As this exceptional thing that we only sometimes do. Right.
Ben: [00:18:49] That’s something really important that we can open up a little bit right now I’ve been reading through The Fifth Discipline lately, one of the things that comes to mind is, first of all, we always feel like we have to do something in response to something bad happening. We have to take action because that’s very much culturally the way that we approach things. And I think we could have a really good conversation about what non-intervention is and looks like for these big kind of messy problems.
But the other thing that this book kind of points out a
In, in a situation that is pressured, right? Then you can
And this reminds me of a lot of what I’ve been thinking
We get together and before we think about how to, deal with
And what could it look like to take this to the next level?
Matt: [00:20:28] So there’s a, there’s a couple of things that I found to be really effective. And one is, you talked about sand table exercises. So when you think about game day, and this is all wound up into whether it’s part of your chaos engineering practice or your failure injection or just a straight up game day, this is all going to end up applying.
Right? And whatever implementation of this is. What, it’s
Like, so it’s, it’s, we always talk about, so PagerDuty, we have, we call them failure Fridays, although really failure any days, but we like alliteration. So we still call it failure Friday and it’s a failure injection exercise, right? It’s like, okay, we’ve got a, we’re going to test the service, we’re going to inject failure into it.
You do a chaos experiment and see what happens, and again, our hypothesis was of course, that it will be fine. I always tell people with chaos, or I’m like, you’re testing a hypothesis, your hypothesis that things will go badly. Don’t test that. Then you’re not ready for the experiment. But anyway, we’re assuming that this will work.
But we run it like a real incident. It’s spun up as an incident.
So what we’ve done is we’ve taken this, we made this pretty
We know how we can bail out of it at any time if things go
Right. It’s just like, it’s a calm thing. And I’ve seen folks
And it’s not an exercise to test, to stress test your
When it does happen at two in the morning, do I mean it’s not going to be stressful? Of course not. It’s going to be stressful because the business is on the line and it’s two in the morning and we’re just upset because we got woke up. But we’re able to be, we, I always like to say the beautiful thing about the human brain is we’re very smart and our brain is also very stupid, right? You can trick your brain, right? So we’re creating a physiological response. And I always tell people, if you don’t believe it, let me tell you some. I’ve worn contacts for the majority of my life, you know, since I was 16 years old, I’ve worn contacts. When I put my contacts in, in the morning, I wake up, I get more alert when I take them out at the end of the evening, I get a little sleepy, and that’s from, you know, 35 years or whatever, of putting contacts in first thing in the morning and taking them out at night. So what we’re doing is we want to create that physiological response, and that’s what we do in post traumatic stress treatment of somatic experiencing is we start with the body, right?
Where we’re getting the physiological way we want to think. And then we associate that. Positive physiology. With the traumatic experience. And so we’re doing the same thing. We’re, we’re leading with the body. And that’s the idea. Beyond doing, like you said, doing these exercises, it’s really not to actually practice and get better at knowing how to log into PagerDuty or how to like, write a document or whatever.
I mean, that’s good.
There’s a similar thing to do that could be helpful. When you have incidents, you run them at their initial severity no matter what. And you run them to their ending. So false alarm incident, finish it out like an incident. I mean, it doesn’t mean like everybody panic and whatever, but this will happen, right?
Like you’ll, you’ll kick off your instant response process
It’s just a thing. Oh, I bop into the Slack channel for an
Like this weird like edge case, right? This is part of life,
Ben: [00:25:18] I really want to call out one thing in particular, which is you differentiated what what is troubleshooting work from what is running the incident. And this building these kinds of capabilities by creating a safe place to exercise these capabilities and build that muscle is not, something that someone goes off in a room and designs a weird puzzle for. This is not about trying to make it hard or trying to stress people out or trying to, kind of, make it a different kind of problem than this. It is just making it safe to do incidents.
And I think that is so enormously amazing.
What is something that you would recommend in terms of
What kind of recommendations can be made.
Matt: [00:26:04] So a couple of things. And we’ll put some links to put in the show notes. So Dr. Peter Levine is a trauma doctor who’s done a lot of research around and kind of founded the somatic experiencing movement. So I gave a link around what Dr Levine has to say about somatic experiencing.
I have another… gave another link that, that delves into the window of tolerance. And another one about how fight or flight really works. So we didn’t really get into that. But that’s, that’s a big part of where this comes in. I also did an episode of Greater than Code about about a year ago, talking about healing organizational trauma, and we’ve talked about a few different things.
Talk a little bit more about… You mentioned at the
So this is why. I always say, write-only postmortems don’t help anybody. Right? You have to they have to be shared and they need to be told. So we talked a lot more about that on that show. I really like that. You know, I rant about this on the Twitters all the time, you know, feel free to find me at Twitter, @MattStratton.
And, we can follow up on this and talk some more on, and also, if you want to see. Where I might be coming to a conference near you. If you go to speaking.mattstratton.com that’s my speaking page. It’s got all my past talks and any upcoming things, so if you all listen to show and I see you at a conference somewhere, come say hi. I’d love to talk about this.
Ben: [00:27:36]
Matt: [00:27:45]
By Ben MosiorLegendary DevOps friend Matt Stratton joins Ben for a discussion on the very near and dear topic of organizational trauma. Matt shares important lessons from his experiences, as well as a few secrets to building organizational resilience through incident response. (Content warning: General discussion of trauma.)
Matt Stratton is a DevOps Advocate at PagerDuty, where he helps dev and ops teams advance the practice of their craft and become more operationally mature. He collaborates with PagerDuty customers and industry thought leaders in the broader DevOps community, and back in the day, his license plate actually said “DevOps”.
Matt has over 20 years experience in IT operations, ranging from large financial institutions such as JPMorganChase and internet firms, including Apartments.com. He is a sought-after speaker internationally, presenting at Agile, DevOps, and ITSM focused events, including ChefConf, DevOpsDays, Interop, PINK, and others worldwide. Matt is the founder and co-host of the popular Arrested DevOps podcast, as well as a global organizer of the DevOpsDays set of conferences.
He lives in Chicago and has three awesome kids, whom he loves just a little bit more than he loves Doctor Who. He is currently on a mission to discover the best phở in the world.
Hey @nashvilledevops folks! Slides from my talk “Fight, Flight, or Freeze – Releasing Organizational Trauma” are online now at https://t.co/diddN5Yihd #DevOpsDays
— matty stratton (@mattstratton) May 10, 2019I would like to take a minute to thank @jpaulreed for encouraging me to initially submit my talk about organizational trauma. It’s been proving to be my most successful talk ever and I’m so happy every time I get to share it with a new audience. Thanks, Preed!
— matty stratton (@mattstratton) March 1, 2019Intro / Outro Music by DJ Quads
https://soundcloud.com/djquads
Ben: [00:00:00]
We’re not going to discuss our own individual specific traumatic experiences. However, we are going to talk in general about the topic and get into how it relates to organizations as well. So with that intro, I am just absolutely floored to have Matt with us here today.
I first kind of came into contact with Matt around 2014
Matt, I’m so happy to have you here today.
Matt: [00:00:56] Thank you for having me. I’m really excited, to be a part and I, I love being on podcasts. As you said. you know, I, I have my own show and it’s always really fun as a podcaster to be a guest, cause you’re like… You’ve got all this stuff… What your listeners don’t know, there’s all these things going on in your head cause you’re running the show and I’m like, I just get to talk! It’s so refreshing.
Ben: [00:01:16] Thank you for being here. I gave a bit of an intro and I think that relates you to my audience, but I was just kind of wondering, what else do you think they should know before we kind of dive into this really interesting topic of organizational trauma?
Matt: [00:01:28] Sure. So I work for PagerDuty now, as an advocate. And the reason I’m kind of bringing that up is, my focus, especially over the last couple of years, has been a lot around thinking about incident response, learning from incidents.
And I’ve been doing a lot of thought around incidents and
So, kind of having that thought and background, I think is
Ben: [00:02:13]
Matt: [00:02:28]
Ben: [00:02:45] This is a such an interesting topic because I think in general, trauma, as a topic for both individuals and organizations, and frankly from a societal and cultural standpoint as well, it’s a bit stigmatized. And I think a lot of times it’s because it’s tied up with things like shame or, not having the right kind of language to express it in a healthy way, or it just hurts, and so you avoid it. And so when someone comes out with a talk, focused on something very close and near and dear to me like this, I am just absolutely floored that we’re sharing it and talking about this topic.
So what was the background behind the motivation to submit
Matt: [00:03:31] So it’s, it’s, it’s kind of a fun story. As fun as anything connected to trauma can be, I realize. So back in the spring of 2018, I spent a month in a rehab facility, for treatment, for trauma, for my post traumatic stress, among a couple of other things. And when I was there, I went through some treatments involving somatic experiencing and EMDR.
And shortly after I got out, I was visiting my friend Paul, so J Paul Reed in San Francisco, and we were, you know, this was like a week or two after I had gotten out and we were having coffee and just sort of debriefing with him about my experience. We both sort of sat there and were like, this sounds a lot like how companies act when they have outages and everything.
And because this is the other thing, when you, when you do this kind of work, everything becomes a talk, everything in your life, you’re always like, how can I turn this into a talk? I have, I have a, a coworker who joined a monastery, she became a monk, and I was like, Oh, you’re going to come out of that with a great DevOps talk, right?
So, so Paul and I were talking about that, and he said,
Right. And but I wanted to do it because I just wanted to
And, and so I did. And I did. So I gave it for the first
This was really interesting. You know, Paul came to me and said, this needs to be your talk of next year, but it really came from my experiences of like what I was learning about individual treatment for post traumatic stress. And, and seeing that as that metaphor. And I think that’s why it’s been a, topic and a talk that I love to give and it’s been, you know, it’s been great to be able to do it over time and continually refine. Every time I do it, it’s a little better because I, like you talked about like this topic, personal trauma is very stigmatized and I get a lot of folks coming up to me after the talk that are just from a personal level or like, Hey, I have trauma in my background.
This is what resonated. This is what didn’t. And there’s also people who don’t have that background can still get something from the talk. But then every time I have these conversations afterwards, there’s always something that I tweak and I go, Oh, now this is better. So it’s, it’s not like, Oh, it’s getting better just cause I’ve been practicing it more, but had been to be able to develop it just from conversations I’ve had with so many people that you, you don’t really realize how prevalent. This is and people don’t talk about it.
Ben: [00:06:46] that right there I think is
And the way that trauma manifests for individuals and organizations is different, but it has a lot of the same kind of signs and I think sort of comorbidities along with it. It’s interesting that you bring up Paul because Paul, I’m going to get this wrong. So he’s probably gonna like, reach out on Twitter and be like, Hey! But he, he got a master’s right in, in human factors, right? So this, I think is an interesting intersection, right? Where I’ve, as you pointed out on the most recent episode of Arrested DevOps, it’s like anytime someone makes a connection between something, part of the industry that’s been around for awhile, but then applies it to software, it’s like, Oh. I made this, right.
Human factors for software and thinking about outages as traumatic events, and I’ve also been thinking about what layoffs are like for people in organizations and so on. Tell me how that factors into how, what you share about this talk. Like what have you learned about these kinds of events and how organizations can respond to them?
Matt: [00:08:14] So there’s those kind of two, pieces to the, the trauma metaphor, the trauma topic in the talk. so again, an incident can be traumatic to an individual because are participating in it. So I talk in some version of the talk, I talk about self care and talk about things to do.
But the beginning and the first half is a lot more about.
Or rather it’s not that we don’t know how to respond. Our
And what I found was,
So we’re like, “well, it’s happening again” and
The same way that like if something that reminds me of this
Like I’m, I know I’d have to be scared, honestly. I don’t
So we get activated between the sympathetic and the parasympathetic nervous system. We go up and down with activation. That’s just normal. That’s just how we deal, right? Like nobody is even keeled all the time. But, and everybody’s window of tolerance is different. So the window of tolerance is. Beyond which you become activated and you don’t have the healthy response anymore.
Right. And so what you can do is it can either get really
So when we see something that is reminiscent of that, we get into either being hyper aroused or hypo aroused, where we might fear any kind of change, right? We’re where we get into analysis paralysis because. Uh, or, or we become, almost, uh, paranoid, we have over vigilance, right? Where everything is an attack.
Everything is says, and these are, neither of these are
So, so what I found was being able to help people identify
And it’s different for Netflix than it is from Chase than it
Right. Um, cause
Like, well, the good news is there’s some things that we can
Ben: [00:13:04] so there are a couple things in there. One is a, it reminds me of a lot of the work that, you know, I’ve personally done in therapy where a lot of the times it’s about building up the language to describe what is happening so that when it happens, either in that moment. If you have had the chance to do this a couple of times and really reflect on it, or later when you are reflecting back on what happened and trying to understand it, you can have the language to describe, Oh, okay.
So I was stuck on in this moment or I was stuck off. And, it just reminds me a lot just basically having a language for it means that we can together share that experience and reflect on it and actually develop the skills to more quickly bounce back, but also be more resilient to the situation and maybe expand that window over time of what’s, what’s tolerable and, or what’s appropriate.
Matt: [00:13:53]
But the biggest thing when we think about, and again, other disclaimer, I am not a mental health professional. So none of this is mental health advice, but things that I’ve learned, right? So. Like you can’t control how you feel, right? you’re gonna have a feeling and it’s going to happen, and that’s totally okay.
And it’s not bad than it happens. What we have control over
And I think that’s true in the company level too, is to say
I can be known to be a little pedantic about words. Sometimes some words matter, some words don’t. One of the words I’ve been striving to excise from my vocabulary and I, I tried to like get other people along.
This is the word prevention because. you’re setting
Nope. Cause that’s a fool’s errand. You’re not going to do
What you want to do
We’re going to respond, we’re going to feel a certain way.
It’s a bad thing happens, I feel crappy about it, but I have
Don’t tell him I tried to say that, but don’t try to
again, It’s all going back to this idea of, the things are
And an organization, it can happen. We can restore service,
You need to be that. That’s just a thing you do and doesn’t
We’re good at, we’re good at responding to incidents and. The
so I think back, you know, I’ve been been a CIS admin for,
Right? And it’s like back in the day, I was probably, you
It was business as usual, right? So now today you can be,
You’re freak out because it’s like, Whoa, this is something crazy, right? This is something out of the blue and you don’t want to be… Number one, you don’t want to be trying to remember how to do instant response. So that’s why practice makes permanent, but also you don’t want it to be this, as much as we would love incidents to be an exception, and they should be, we don’t want to think about, the, the response process. As this exceptional thing that we only sometimes do. Right.
Ben: [00:18:49] That’s something really important that we can open up a little bit right now I’ve been reading through The Fifth Discipline lately, one of the things that comes to mind is, first of all, we always feel like we have to do something in response to something bad happening. We have to take action because that’s very much culturally the way that we approach things. And I think we could have a really good conversation about what non-intervention is and looks like for these big kind of messy problems.
But the other thing that this book kind of points out a
In, in a situation that is pressured, right? Then you can
And this reminds me of a lot of what I’ve been thinking
We get together and before we think about how to, deal with
And what could it look like to take this to the next level?
Matt: [00:20:28] So there’s a, there’s a couple of things that I found to be really effective. And one is, you talked about sand table exercises. So when you think about game day, and this is all wound up into whether it’s part of your chaos engineering practice or your failure injection or just a straight up game day, this is all going to end up applying.
Right? And whatever implementation of this is. What, it’s
Like, so it’s, it’s, we always talk about, so PagerDuty, we have, we call them failure Fridays, although really failure any days, but we like alliteration. So we still call it failure Friday and it’s a failure injection exercise, right? It’s like, okay, we’ve got a, we’re going to test the service, we’re going to inject failure into it.
You do a chaos experiment and see what happens, and again, our hypothesis was of course, that it will be fine. I always tell people with chaos, or I’m like, you’re testing a hypothesis, your hypothesis that things will go badly. Don’t test that. Then you’re not ready for the experiment. But anyway, we’re assuming that this will work.
But we run it like a real incident. It’s spun up as an incident.
So what we’ve done is we’ve taken this, we made this pretty
We know how we can bail out of it at any time if things go
Right. It’s just like, it’s a calm thing. And I’ve seen folks
And it’s not an exercise to test, to stress test your
When it does happen at two in the morning, do I mean it’s not going to be stressful? Of course not. It’s going to be stressful because the business is on the line and it’s two in the morning and we’re just upset because we got woke up. But we’re able to be, we, I always like to say the beautiful thing about the human brain is we’re very smart and our brain is also very stupid, right? You can trick your brain, right? So we’re creating a physiological response. And I always tell people, if you don’t believe it, let me tell you some. I’ve worn contacts for the majority of my life, you know, since I was 16 years old, I’ve worn contacts. When I put my contacts in, in the morning, I wake up, I get more alert when I take them out at the end of the evening, I get a little sleepy, and that’s from, you know, 35 years or whatever, of putting contacts in first thing in the morning and taking them out at night. So what we’re doing is we want to create that physiological response, and that’s what we do in post traumatic stress treatment of somatic experiencing is we start with the body, right?
Where we’re getting the physiological way we want to think. And then we associate that. Positive physiology. With the traumatic experience. And so we’re doing the same thing. We’re, we’re leading with the body. And that’s the idea. Beyond doing, like you said, doing these exercises, it’s really not to actually practice and get better at knowing how to log into PagerDuty or how to like, write a document or whatever.
I mean, that’s good.
There’s a similar thing to do that could be helpful. When you have incidents, you run them at their initial severity no matter what. And you run them to their ending. So false alarm incident, finish it out like an incident. I mean, it doesn’t mean like everybody panic and whatever, but this will happen, right?
Like you’ll, you’ll kick off your instant response process
It’s just a thing. Oh, I bop into the Slack channel for an
Like this weird like edge case, right? This is part of life,
Ben: [00:25:18] I really want to call out one thing in particular, which is you differentiated what what is troubleshooting work from what is running the incident. And this building these kinds of capabilities by creating a safe place to exercise these capabilities and build that muscle is not, something that someone goes off in a room and designs a weird puzzle for. This is not about trying to make it hard or trying to stress people out or trying to, kind of, make it a different kind of problem than this. It is just making it safe to do incidents.
And I think that is so enormously amazing.
What is something that you would recommend in terms of
What kind of recommendations can be made.
Matt: [00:26:04] So a couple of things. And we’ll put some links to put in the show notes. So Dr. Peter Levine is a trauma doctor who’s done a lot of research around and kind of founded the somatic experiencing movement. So I gave a link around what Dr Levine has to say about somatic experiencing.
I have another… gave another link that, that delves into the window of tolerance. And another one about how fight or flight really works. So we didn’t really get into that. But that’s, that’s a big part of where this comes in. I also did an episode of Greater than Code about about a year ago, talking about healing organizational trauma, and we’ve talked about a few different things.
Talk a little bit more about… You mentioned at the
So this is why. I always say, write-only postmortems don’t help anybody. Right? You have to they have to be shared and they need to be told. So we talked a lot more about that on that show. I really like that. You know, I rant about this on the Twitters all the time, you know, feel free to find me at Twitter, @MattStratton.
And, we can follow up on this and talk some more on, and also, if you want to see. Where I might be coming to a conference near you. If you go to speaking.mattstratton.com that’s my speaking page. It’s got all my past talks and any upcoming things, so if you all listen to show and I see you at a conference somewhere, come say hi. I’d love to talk about this.
Ben: [00:27:36]
Matt: [00:27:45]