August 20, 2025

LLM-D, with Clayton Coleman and Rob Shaw

Listen Later

52 minutes

Guests are Clayton Coleman and Rob Shaw. Clayton is a Core contributor to Kubernetes, the containerized cluster manager, and founding architect for OpenShift, the open source platform as a service. Clayton helped launch the shift to cloud native applications and the platforms that enable them. At Google my mission is to make Kubernetes and GKE the best place to run workloads, especially accelerated AI/ML workloads, and especially especially very large model inference at scale with the inference gateway and llm-d. Rob Shaw is an Engineering Director at Redhat and is a contributor to the vLLM project.

Do you have something cool to share? Some questions? Let us know:

- web: kubernetespodcast.com

- mail: [email protected]

- twitter: @kubernetespod

- bluesky: @kubernetespodcast.com

News of the week

Kubernetes 1.34 is expected to release end of August
Kubecrash.io: A platform Eng conference with a purpose
CNCF top 30 project of 2025

Links from the interview

LLM-D
KubeCon EU 25 Keynote: LLM-Aware Load Balancing in Kubernetes
WG Serving
vLLM
Disaggregated Prefilling
LWS: LeaderWorkerSet

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Kubernetes Podcast from Google

By Abdel Sghiouar, Kaslin Fields

4.8

179179 ratings

August 20, 2025

LLM-D, with Clayton Coleman and Rob Shaw

Listen Later

52 minutes

Guests are Clayton Coleman and Rob Shaw. Clayton is a Core contributor to Kubernetes, the containerized cluster manager, and founding architect for OpenShift, the open source platform as a service. Clayton helped launch the shift to cloud native applications and the platforms that enable them. At Google my mission is to make Kubernetes and GKE the best place to run workloads, especially accelerated AI/ML workloads, and especially especially very large model inference at scale with the inference gateway and llm-d. Rob Shaw is an Engineering Director at Redhat and is a contributor to the vLLM project.

Do you have something cool to share? Some questions? Let us know:

- web: kubernetespodcast.com

- mail: [email protected]

- twitter: @kubernetespod

- bluesky: @kubernetespodcast.com

News of the week

Kubernetes 1.34 is expected to release end of August
Kubecrash.io: A platform Eng conference with a purpose
CNCF top 30 project of 2025

Links from the interview

LLM-D
KubeCon EU 25 Keynote: LLM-Aware Load Balancing in Kubernetes
WG Serving
vLLM
Disaggregated Prefilling
LWS: LeaderWorkerSet

...more

More shows like Kubernetes Podcast from Google

Software Engineering Radio - the podcast for professional software developers by team@se-radio.net (SE-Radio Team)

Software Engineering Radio - the podcast for professional software developers

275 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

289 Listeners

Security Now (Audio) by TWiT

Security Now (Audio)

2,009 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

626 Listeners

Risky Business by Risky Business Media

Risky Business

375 Listeners

The Enterprise AI Show by Massive Studios

The Enterprise AI Show

149 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

583 Listeners

HBR IdeaCast by Harvard Business Review

HBR IdeaCast

145 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

345 Listeners

Inside the Strategy Room by McKinsey & Company

Inside the Strategy Room

170 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

203 Listeners

Last Week in AI by Skynet Today

Last Week in AI

314 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

508 Listeners

KubeFM by KubeFM

KubeFM

2 Listeners

The Pragmatic Engineer by Gergely Orosz

The Pragmatic Engineer

74 Listeners