The New Stack Podcast

Do All Your AI Workloads Actually Require Expensive GPUs?


Listen Later

GPUs dominate today’s AI landscape, but Google argues they are not necessary for every workload. As AI adoption has grown, customers have increasingly demanded compute options that deliver high performance with lower cost and power consumption. Drawing on its long history of custom silicon, Google introduced Axion CPUs in 2024 to meet needs for massive scale, flexibility, and general-purpose computing alongside AI workloads. The Axion-based C4A instance is generally available, while the newer N4A virtual machines promise up to 2x price performance.

In this episode, Andrei Gueletii, a technical solutions consultant for Google Cloud joined Gari Singh, a product manager for Google Kubernetes Engine (GKE), and Pranay Bakre, a principal solutions engineer at Arm for this episode, recorded at KubeCon + CloudNativeCon North America, in Atlanta. Built on Arm Neoverse V2 cores, Axion processors emphasize energy efficiency and customization, including flexible machine shapes that let users tailor memory and CPU resources. These features are particularly valuable for platform engineering teams, which must optimize centralized infrastructure for cost, FinOps goals, and price performance as they scale.

Importantly, many AI tasks—such as inference for smaller models or batch-oriented jobs—do not require GPUs. CPUs can be more efficient when GPU memory is underutilized or latency demands are low. By decoupling workloads and choosing the right compute for each task, organizations can significantly reduce AI compute costs.

Learn more from The New Stack about the Axion-based C4A: 

Beyond Speed: Why Your Next App Must Be Multi-Architecture

Arm: See a Demo About Migrating a x86-Based App to ARM64

Join our community of newsletter subscribers to stay on top of the news and at the top of your game. 


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

...more
View all episodesView all episodes
Download on the App Store

The New Stack PodcastBy The New Stack

  • 4.3
  • 4.3
  • 4.3
  • 4.3
  • 4.3

4.3

31 ratings


More shows like The New Stack Podcast

View all
The New Stack Analysts by The New Stack

The New Stack Analysts

9 Listeners

The New Stack @ Scale by The New Stack

The New Stack @ Scale

3 Listeners

WSJ What’s News by The Wall Street Journal

WSJ What’s News

4,352 Listeners

Bloomberg Surveillance by Bloomberg

Bloomberg Surveillance

1,173 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

288 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,095 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

625 Listeners

Risky Business by Patrick Gray

Risky Business

372 Listeners

The New Stack Context by The New Stack

The New Stack Context

4 Listeners

Tech Brew Ride Home by Morning Brew

Tech Brew Ride Home

965 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

204 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,019 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

525 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

504 Listeners

Hard Fork by The New York Times

Hard Fork

5,528 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

228 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

632 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners