
Sign up to save your podcasts
Or


Welcome to AI Unraveled: Your daily strategic briefing on the business impact of AI.
Today's Highlights: We are switching to "Special Episode" status for a critical infrastructure deep dive. We tackle the GPU Scheduling Nightmare—why your expensive H100s are sitting idle, why default Kubernetes fails at AI orchestration, and the new playbook enterprises are using to reclaim millions in wasted compute.
Strategic Pillars & Topics
📉 The Core Problem: The "Idle Iron" Crisis
🛠 The Solutions: Advanced Orchestration
🛡 Security & Multi-Tenancy
Host Connection & Engagement
Timestamps
00:00 Welcome & The "Idle Iron" Crisis 🎙️
01:50 The Default Kubernetes Failure Mode (FIFO & fragmentation)
03:20 Why AI Workloads are Different (Training vs. Inference)
05:50 Strategy 1: Gang Scheduling Explained
07:40 Strategy 2: Bin Packing for Density
08:30 Strategy 3: Preemption & The "Resume" Problem
09:50 Strategy 4: Multi-Instance GPUs (MIG) & Slicing
11:20 Governance: Quotas & Fair Share Scheduling
12:50 Security: Multi-tenancy & Isolation
14:10 Tooling Landscape: Volcano, YuniKorn, & Run:AI 🧰
15:45 Final Thesis: Utilization = Revenue 💰
🚀 STOP MARKETING TO THE MASSES. START BRIEFING THE C-SUITE.
Leverage our zero-noise intelligence to own the conversation in your industry. Secure Your Strategic Podcast Consultation Now: https://forms.gle/YHQPzQcZecFbmNds5
Keywords: Kubernetes AI, GPU Scheduling, Nvidia H100, Gang Scheduling, Bin Packing, Multi-Instance GPU, MIG, AI Infrastructure, MLOps, Run:AI, Volcano Scheduler, YuniKorn, Etienne Noumen.
#AI #AIUnraveled
By Etienne Noumen4.7
1313 ratings
Welcome to AI Unraveled: Your daily strategic briefing on the business impact of AI.
Today's Highlights: We are switching to "Special Episode" status for a critical infrastructure deep dive. We tackle the GPU Scheduling Nightmare—why your expensive H100s are sitting idle, why default Kubernetes fails at AI orchestration, and the new playbook enterprises are using to reclaim millions in wasted compute.
Strategic Pillars & Topics
📉 The Core Problem: The "Idle Iron" Crisis
🛠 The Solutions: Advanced Orchestration
🛡 Security & Multi-Tenancy
Host Connection & Engagement
Timestamps
00:00 Welcome & The "Idle Iron" Crisis 🎙️
01:50 The Default Kubernetes Failure Mode (FIFO & fragmentation)
03:20 Why AI Workloads are Different (Training vs. Inference)
05:50 Strategy 1: Gang Scheduling Explained
07:40 Strategy 2: Bin Packing for Density
08:30 Strategy 3: Preemption & The "Resume" Problem
09:50 Strategy 4: Multi-Instance GPUs (MIG) & Slicing
11:20 Governance: Quotas & Fair Share Scheduling
12:50 Security: Multi-tenancy & Isolation
14:10 Tooling Landscape: Volcano, YuniKorn, & Run:AI 🧰
15:45 Final Thesis: Utilization = Revenue 💰
🚀 STOP MARKETING TO THE MASSES. START BRIEFING THE C-SUITE.
Leverage our zero-noise intelligence to own the conversation in your industry. Secure Your Strategic Podcast Consultation Now: https://forms.gle/YHQPzQcZecFbmNds5
Keywords: Kubernetes AI, GPU Scheduling, Nvidia H100, Gang Scheduling, Bin Packing, Multi-Instance GPU, MIG, AI Infrastructure, MLOps, Run:AI, Volcano Scheduler, YuniKorn, Etienne Noumen.
#AI #AIUnraveled

1,640 Listeners

334 Listeners

226 Listeners

207 Listeners

494 Listeners

197 Listeners

157 Listeners

154 Listeners

227 Listeners

608 Listeners

107 Listeners

54 Listeners

173 Listeners

55 Listeners

96 Listeners