The New Stack Podcast

Why the CNCF's New Executive Director is Obsessed With Inference


Listen Later

Jonathan Bryce, the new CNCF executive director, argues that inference—not model training—will define the next decade of computing. Speaking at KubeCon North America 2025, he emphasized that while the industry obsesses over massive LLM training runs, the real opportunity lies in efficiently serving these models at scale. Cloud-native infrastructure, he says, is uniquely suited to this shift because inference requires real-time deployment, security, scaling, and observability—strengths of the CNCF ecosystem. 

Bryce believes Kubernetes is already central to modern inference stacks, with projects like Ray, KServe, and emerging GPU-oriented tooling enabling teams to deploy and operationalize models. To bring consistency to this fast-moving space, the CNCF launched a Kubernetes AI Conformance Program, ensuring environments support GPU workloads and Dynamic Resource Allocation. With AI agents poised to multiply inference demand by executing parallel, multi-step tasks, efficiency becomes essential. Bryce predicts that smaller, task-specific models and cloud-native routing optimizations will drive major performance gains. Ultimately, he sees CNCF technologies forming the foundation for what he calls “the biggest workload mankind will ever have.” 

Learn more from The New Stack about inference: 

Confronting AI’s Next Big Challenge: Inference Compute 

Deep Infra Is Building an AI Inference Cloud for Developers 

Join our community of newsletter subscribers to stay on top of the news and at the top of your game. 


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

...more
View all episodesView all episodes
Download on the App Store

The New Stack PodcastBy The New Stack

  • 4.3
  • 4.3
  • 4.3
  • 4.3
  • 4.3

4.3

31 ratings


More shows like The New Stack Podcast

View all
The New Stack Analysts by The New Stack

The New Stack Analysts

9 Listeners

The New Stack @ Scale by The New Stack

The New Stack @ Scale

3 Listeners

WSJ What’s News by The Wall Street Journal

WSJ What’s News

4,358 Listeners

Bloomberg Surveillance by Bloomberg

Bloomberg Surveillance

1,177 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

288 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,097 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

624 Listeners

Risky Business by Patrick Gray

Risky Business

372 Listeners

The New Stack Context by The New Stack

The New Stack Context

4 Listeners

Tech Brew Ride Home by Morning Brew

Tech Brew Ride Home

968 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

207 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,064 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

531 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

504 Listeners

Hard Fork by The New York Times

Hard Fork

5,532 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

227 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

638 Listeners

AI + a16z by a16z

AI + a16z

35 Listeners