
Sign up to save your podcasts
Or


Episode 30 of Cloud Computing with Fexingo dives into resiliency engineering—the practice of deliberately breaking systems in production to build stronger infrastructure. Lucas and Luna explore how Netflix's Chaos Monkey evolved into a full-blown discipline adopted by AWS, Azure, and Google Cloud. They dissect the real story behind a 2023 incident at a major European bank that lost $150 million in three hours due to a cascading DNS failure, and explain why 'chaos engineering' is no longer optional for enterprises running mission-critical workloads. The hosts also discuss how cloud providers now embed fault injection tools directly into their platforms, and why the financial services sector is leading adoption. This episode offers concrete takeaways for architects and engineering leaders looking to reduce mean time to recovery without exploding their cloud bill.
#CloudComputing #ResiliencyEngineering #ChaosEngineering #NetflixChaosMonkey #AWS #Azure #GoogleCloud #SiteReliabilityEngineering #FaultInjection #DNSFailure #FinancialServices #MeanTimeToRecovery #Architecture #Technology #FexingoBusiness #BusinessPodcast #CloudInfrastructure #Podcast
Keep every episode free: buymeacoffee.com/fexingo
By FexingoEpisode 30 of Cloud Computing with Fexingo dives into resiliency engineering—the practice of deliberately breaking systems in production to build stronger infrastructure. Lucas and Luna explore how Netflix's Chaos Monkey evolved into a full-blown discipline adopted by AWS, Azure, and Google Cloud. They dissect the real story behind a 2023 incident at a major European bank that lost $150 million in three hours due to a cascading DNS failure, and explain why 'chaos engineering' is no longer optional for enterprises running mission-critical workloads. The hosts also discuss how cloud providers now embed fault injection tools directly into their platforms, and why the financial services sector is leading adoption. This episode offers concrete takeaways for architects and engineering leaders looking to reduce mean time to recovery without exploding their cloud bill.
#CloudComputing #ResiliencyEngineering #ChaosEngineering #NetflixChaosMonkey #AWS #Azure #GoogleCloud #SiteReliabilityEngineering #FaultInjection #DNSFailure #FinancialServices #MeanTimeToRecovery #Architecture #Technology #FexingoBusiness #BusinessPodcast #CloudInfrastructure #Podcast
Keep every episode free: buymeacoffee.com/fexingo