Signals and Threads

The Uncertain Art of Accelerating ML Models with Sylvain Gugger


Listen Later

Sylvain Gugger is a former math teacher who fell into machine learning via a MOOC and became an expert in the low-level performance details of neural networks. He’s now on the ML infrastructure team at Jane Street, where he helps traders speed up their models. In this episode, Sylvain and Ron go deep on learning rate schedules; the subtle performance bugs PyTorch lets you write; how to keep a hungry GPU well-fed; and lots more, including the foremost importance of reproducibility in training runs. They also discuss some of the unique challenges of doing ML in the world of trading, like the unusual size and shape of market data and the need to do inference at shockingly low latencies.

You can find the transcript for this episode  on our website.

Some links to topics that came up in the discussion:

  • “Practical Deep Learning for Coders,” a FastAI MOOC by Jeremy Howard, and the book, of which Sylvain is a co-author.
  • The Stanford DAWNBench competition that Sylvain participated in.
  • HuggingFace, and the Accelerate library that Sylvain wrote there.
  • Some of the languages/systems for expression ML models that were discussed: PyTorch, TensorFlow, Jax, Mojo, and Triton
  • CUDA graphs and streams
  • Hogwild concurrency
...more
View all episodesView all episodes
Download on the App Store

Signals and ThreadsBy Jane Street

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

72 ratings


More shows like Signals and Threads

View all
Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

Software Engineering Radio - the podcast for professional software developers

272 Listeners

Top Traders Unplugged by Niels Kaastrup-Larsen

Top Traders Unplugged

586 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

591 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,396 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

627 Listeners

Odd Lots by Bloomberg

Odd Lots

1,768 Listeners

CoRecursive: Coding Stories by Adam Gordon Bell - Software Developer

CoRecursive: Coding Stories

189 Listeners

Flirting with Models by Corey Hoffstein

Flirting with Models

221 Listeners

Alpha Exchange by Dean Curnutt

Alpha Exchange

80 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

355 Listeners

Oxide and Friends by Oxide Computer Company

Oxide and Friends

47 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

76 Listeners

Money Stuff: The Podcast by Bloomberg

Money Stuff: The Podcast

368 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

116 Listeners

The Pragmatic Engineer by Gergely Orosz

The Pragmatic Engineer

52 Listeners