Cloud Computing with Fexingo: AWS, Azure, GCP, and Modern Infrastructure Conversations

How Cloud Resiliency Engineering Became the New Reliability Standard


Listen Later

Episode 30 of Cloud Computing with Fexingo dives into resiliency engineering—the practice of deliberately breaking systems in production to build stronger infrastructure. Lucas and Luna explore how Netflix's Chaos Monkey evolved into a full-blown discipline adopted by AWS, Azure, and Google Cloud. They dissect the real story behind a 2023 incident at a major European bank that lost $150 million in three hours due to a cascading DNS failure, and explain why 'chaos engineering' is no longer optional for enterprises running mission-critical workloads. The hosts also discuss how cloud providers now embed fault injection tools directly into their platforms, and why the financial services sector is leading adoption. This episode offers concrete takeaways for architects and engineering leaders looking to reduce mean time to recovery without exploding their cloud bill.

#CloudComputing #ResiliencyEngineering #ChaosEngineering #NetflixChaosMonkey #AWS #Azure #GoogleCloud #SiteReliabilityEngineering #FaultInjection #DNSFailure #FinancialServices #MeanTimeToRecovery #Architecture #Technology #FexingoBusiness #BusinessPodcast #CloudInfrastructure #Podcast

Keep every episode free: buymeacoffee.com/fexingo

...more
View all episodesView all episodes
Download on the App Store

Cloud Computing with Fexingo: AWS, Azure, GCP, and Modern Infrastructure ConversationsBy Fexingo