
Sign up to save your podcasts
Or
Send us a Text Message.
In this episode of DevOps Sauna Season 4, the hosts dive into the recent CrowdStrike incident, which caused widespread bluescreen errors and significant disruptions globally. They explore why a seemingly routine update was deployed simultaneously to all CrowdStrike users, resulting in massive system crashes.
Joined by security expert and previous host Andy Allred, the discussion covers the role of CrowdStrike as an endpoint detection and response (EDR) system, its necessity for running with high privileges in kernel space, and the challenges of maintaining such critical security software.
The conversation highlights the need for rigorous testing, canary releases, and robust observability to prevent similar incidents. The hosts also discuss the implications of regulatory requirements, the importance of continuous delivery models in DevOps, and the lessons learned from the CrowdStrike mishap.
Despite the complexity and scale of the recovery process, the consensus is clear: Continuous improvement in testing and deployment practices is crucial for the stability and security of modern IT environments.
Create value in every commit with continuous delivery: https://www.eficode.com/services/continuous-delivery
Learn how to secure your DevOps practices, how to meet the needs of different stakeholders, and about combining Agility, structure, and high security in software development: https://www.eficode.com/blog/events/devsecops-webinar-secure-continuous-development-in-it-environments
5
22 ratings
Send us a Text Message.
In this episode of DevOps Sauna Season 4, the hosts dive into the recent CrowdStrike incident, which caused widespread bluescreen errors and significant disruptions globally. They explore why a seemingly routine update was deployed simultaneously to all CrowdStrike users, resulting in massive system crashes.
Joined by security expert and previous host Andy Allred, the discussion covers the role of CrowdStrike as an endpoint detection and response (EDR) system, its necessity for running with high privileges in kernel space, and the challenges of maintaining such critical security software.
The conversation highlights the need for rigorous testing, canary releases, and robust observability to prevent similar incidents. The hosts also discuss the implications of regulatory requirements, the importance of continuous delivery models in DevOps, and the lessons learned from the CrowdStrike mishap.
Despite the complexity and scale of the recovery process, the consensus is clear: Continuous improvement in testing and deployment practices is crucial for the stability and security of modern IT environments.
Create value in every commit with continuous delivery: https://www.eficode.com/services/continuous-delivery
Learn how to secure your DevOps practices, how to meet the needs of different stakeholders, and about combining Agility, structure, and high security in software development: https://www.eficode.com/blog/events/devsecops-webinar-secure-continuous-development-in-it-environments
285 Listeners
153 Listeners
5,238 Listeners
29 Listeners
36 Listeners
16 Listeners