KubeFM

From Fragile to Faultless: Kubernetes Self-Healing In Practice, with Grzegorz Głąb


Listen Later

Discover how to build resilient Kubernetes environments at scale with practical automation strategies from an engineer who's tackled complex production challenges.

Grzegorz Głąb, Kubernetes Engineer at Cloud Kitchens, shares his team's journey developing a comprehensive self-healing framework. He explains how they addressed issues ranging from spot node preemptions to network packet drops caused by unbalanced IRQs, providing concrete examples of automation that prevents downtime and improves reliability.

You will learn:

  • How managed Kubernetes services like AKS provide benefits but require customization for specific use cases

  • The architecture of an effective self-healing framework using DaemonSets and deployments with Kubernetes-native components

  • Practical solutions for common challenges like StatefulSet pods stuck on unreachable nodes and cleaning up orphaned pods

  • Techniques for workload-level automation, including throttling CPU-hungry pods and automating diagnostic data collection

Sponsor

This episode is sponsored by LearnKube — get started on your Kubernetes journey through comprehensive online, in-person or remote training.

More info

  • Find all the links and info for this episode here: https://ku.bz/yg_fkP0LN

  • Interested in sponsoring an episode? Learn more.

...more
View all episodesView all episodes
Download on the App Store

KubeFMBy KubeFM

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like KubeFM

View all
Software Engineering Radio by se-radio@computer.org

Software Engineering Radio

273 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

290 Listeners

Security Now (Audio) by TWiT

Security Now (Audio)

2,004 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

625 Listeners

LINUX Unplugged by Jupiter Broadcasting

LINUX Unplugged

265 Listeners

The Cloudcast by Massive Studios

The Cloudcast

153 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

587 Listeners

Soft Skills Engineering by Jamison Dance and Dave Smith

Soft Skills Engineering

283 Listeners

Thoughtworks Technology Podcast by Thoughtworks

Thoughtworks Technology Podcast

42 Listeners

Late Night Linux by The Late Night Linux Family

Late Night Linux

165 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

181 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

202 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

62 Listeners

2.5 Admins by The Late Night Linux Family

2.5 Admins

99 Listeners

Oxide and Friends by Oxide Computer Company

Oxide and Friends

59 Listeners