KubeFM

How Policies Saved us a Thousand Headaches, with Alessandro Pomponio


Listen Later

Alessandro Pomponio from IBM Research explains how his team transformed their chaotic bare-metal clusters into a well-governed, self-service platform for AI and scientific workloads. He walks through their journey from manual cluster interventions to a fully automated GitOps-first architecture using ArgoCD, Kyverno, and Kueue to handle everything from policy enforcement to GPU scheduling.

You will learn:

  • How to implement GitOps workflows that reduce administrative burden while maintaining governance and visibility across multi-tenant research environments

  • Practical policy enforcement strategies using Kyverno to prevent GPU monopolization, block interactive pod usage, and automatically inject scheduling constraints

  • Fair resource sharing techniques with Kueue to manage scarce GPU resources across different hardware types while supporting both specific and flexible allocation requests

  • Organizational change management approaches for gaining stakeholder buy-in, upskilling admin teams, and communicating policy changes to research users

Sponsor

This episode is brought to you by Testkube—the ultimate Continuous Testing Platform for Cloud Native applications. Scale fast, test continuously, and ship confidently. Check it out at testkube.io

More info

  • Find all the links and info for this episode here: https://ku.bz/5sK7BFZ-8

  • Interested in sponsoring an episode? Learn more.

...more
View all episodesView all episodes
Download on the App Store

KubeFMBy KubeFM

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like KubeFM

View all
Software Engineering Radio by se-radio@computer.org

Software Engineering Radio

273 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

290 Listeners

Security Now (Audio) by TWiT

Security Now (Audio)

2,004 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

625 Listeners

LINUX Unplugged by Jupiter Broadcasting

LINUX Unplugged

265 Listeners

The Cloudcast by Massive Studios

The Cloudcast

153 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

587 Listeners

Soft Skills Engineering by Jamison Dance and Dave Smith

Soft Skills Engineering

283 Listeners

Thoughtworks Technology Podcast by Thoughtworks

Thoughtworks Technology Podcast

42 Listeners

Late Night Linux by The Late Night Linux Family

Late Night Linux

165 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

181 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

202 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

62 Listeners

2.5 Admins by The Late Night Linux Family

2.5 Admins

99 Listeners

Oxide and Friends by Oxide Computer Company

Oxide and Friends

59 Listeners