Google SRE Prodcast

The One With Damion Yates and Building AI systems


Listen Later

How do you introduce Site Reliability Engineering to an AI research lab, bringing concepts of scale to engineers who are at the leading edge of AI systems?

In the latest episode of The Prodcast, hosts Steve McGhee and Florian Rathgeber chat with Damion Yates, who helped establish the reliability engineering culture at Google DeepMind. Damion shares his journey of bringing scalable infrastructure to DeepMind, supporting massive machine learning experiments.

Discover the unique challenges of supporting AI research, such as managing highly expensive "lockstep" training models where a single machine failure halts the entire process. Damion also explains why he believes "luck is our enemy" in systems engineering, and why protecting a research scientist's time is the ultimate metric for success.

...more
View all episodesView all episodes
Download on the App Store

Google SRE ProdcastBy Salim Virji

  • 5
  • 5
  • 5
  • 5
  • 5

5

18 ratings


More shows like Google SRE Prodcast

View all
Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

32,246 Listeners

Planet Money by NPR

Planet Money

30,609 Listeners

Hidden Brain by Hidden Brain, Shankar Vedantam

Hidden Brain

43,687 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

288 Listeners

The Reasoning Show by Massive Studios

The Reasoning Show

154 Listeners

All In The Mind by ABC Australia

All In The Mind

759 Listeners

Warriors Plus Minus: A show about the Golden State Warriors by Audacy

Warriors Plus Minus: A show about the Golden State Warriors

681 Listeners

Python Bytes by Michael Kennedy and Brian Okken

Python Bytes

214 Listeners

The Indicator from Planet Money by NPR

The Indicator from Planet Money

9,556 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

180 Listeners

The World in Brief from The Economist by The Economist

The World in Brief from The Economist

1,089 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

551 Listeners

Hard Fork by The New York Times

Hard Fork

5,576 Listeners

The Rest Is Money by Goalhanger

The Rest Is Money

195 Listeners

ThursdAI - The top AI news from the past week by From Weights & Biases, Join AI Evangelist Alex Volkov and a panel of experts to cover everything important that happened in the world of AI from the past week

ThursdAI - The top AI news from the past week

16 Listeners