OpenAI Podcast

Episode 18 - Why AI needs a new kind of supercomputer network


Listen Later

Training frontier models isn’t as simple as adding more GPUs—one small problem and the whole coordinated dance falls apart. OpenAI’s Mark Handley and Greg Steinbrecher discuss how a new supercomputer network design, used to train some of the company’s latest models, keeps the whole system moving in lockstep, even with record numbers of GPUs. They break down Multipath Reliable Connection, a new protocol OpenAI developed with AMD, Broadcom, Intel, Microsoft, and Nvidia, and why they’re making it available for the whole industry to use.


Chapters

00:00 Intro

00:39 Greg and Mark's paths to OpenAI

04:34 Why training AI stresses networks differently

10:05 Bottlenecks, failures, and the cost of waiting

15:19 How Multipath Reliable Connection works

18:59 A protocol to route around failures

25:05 Why OpenAI is making MRC an open standard

35:09 Could AI compute move to space?



Hosted on Acast. See acast.com/privacy for more information.

...more
View all episodesView all episodes
Download on the App Store

OpenAI PodcastBy OpenAI

  • 4.4
  • 4.4
  • 4.4
  • 4.4
  • 4.4

4.4

58 ratings


More shows like OpenAI Podcast

View all
The a16z Show by Andreessen Horowitz

The a16z Show

1,105 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

343 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

233 Listeners

Practical AI by Practical AI LLC

Practical AI

212 Listeners

Last Week in AI by Skynet Today

Last Week in AI

313 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

101 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

551 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

512 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

150 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

101 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

228 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

688 Listeners

The Next Wave - AI and The Future of Technology by Mindstream (Hubspot Media)

The Next Wave - AI and The Future of Technology

55 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners

How I AI by Claire Vo

How I AI

158 Listeners