The Cloudcast

Chaos Engineering and Team Health


Listen Later

SHOW: 415

DESCRIPTION: Brian talks with Paul Osman (@paulosman, SRE Engineering Manager @UnderArmour) about aligning business value to Chaos Engineering, measuring its impact, and changing team culture to embrace the chaos.

SHOW SPONSOR LINKS:

  • PricingWire:  Monetization & Pricing Strategy for Software & Technology Innovators
  • PricingWire - Pricing Metric Decision Guide


  • Digital Ocean Homepage
  • Get Started Now and Get a free $50 Credit on Digital Ocean
  • [FREE] Try an IT Pro Challenge
  • Get 20% off VelocityConf passes using discount code CLOUD

CLOUD NEWS OF THE WEEK:

  • Kong announces "Kuma" Service Mesh
  • "Maesh" - by Containous - simpler Service Mesh 

SHOW INTERVIEW LINKS:

  • Paul’s Books (Microservices with JavaScript, Microservices Development)
  • [video] Embracing Chaos - DevOps Day Austin
  • [Velocity] Managing Chaos: Chaos Engineering and Team Health
  • Under Armour Homepage
  • “Chaos Engineering” on previous episodes of The Cloudcast

SHOW NOTES:

Topic 1 - Welcome to the show. Before we get into Chaos Engineering, let’s talk a little bit about your background and some of the things you did prior to joining Under Armour. 

Topic 2 - We’ve talked about Chaos Engineering a few times on the show before. At a company level, what are some of the things (Connected Health) where it makes sense for Under Armour to be investing in Chaos Engineering and developing expertise around this discipline?

Topic 3 - Walk us through how a team at Under Armour thinks about Chaos Engineering, from the business need to think about scheduling it (or not scheduling it), measuring it, and then communicating the results back within your team and to management.

Topic 4 - I think people think that Chaos is a periodic event, like a DR test, but in reality, it needs to be somewhat of an on-going activity. How do you connect the dots between this on-going Chaos and actual problems in your systems - and how/when to measure problems (or what to measure)?

Topic 5 - What is the most difficult part about getting the team culture to understand that Chaos is an important part of day-to-day activities and dealing with “failure” being part of the system?

FEEDBACK?

  • Email: show at thecloudcast dot net
  • Twitter: @thecloudcastnet and @ServerlessCast
...more
View all episodesView all episodes
Download on the App Store

The CloudcastBy Massive Studios

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

147 ratings


More shows like The Cloudcast

View all
Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

Software Engineering Radio - the podcast for professional software developers

272 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

283 Listeners

Thoughtworks Technology Podcast by Thoughtworks

Thoughtworks Technology Podcast

41 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

584 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

624 Listeners

Soft Skills Engineering by Jamison Dance and Dave Smith

Soft Skills Engineering

284 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

201 Listeners

Gartner ThinkCast by Gartner

Gartner ThinkCast

108 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

140 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

989 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

184 Listeners

Practical AI by Practical AI LLC

Practical AI

186 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

63 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

140 Listeners

The Pragmatic Engineer by Gergely Orosz

The Pragmatic Engineer

62 Listeners