June 24, 2022

Preetum Nakkiran: An Empirical Theory of Deep Learning

1 hour 37 minutes

In episode 31 of The Gradient Podcast, Daniel Bashir speaks to Preetum Nakkiran.

Preetum is a Research Scientist at Apple, a Visiting Researcher at UCSD, and part of the NSF/Simons Collaboration on the Theoretical Foundations of Deep Learning. He completed his PhD at Harvard, where he co-founded the ML Foundations Group. Preetum’s research focuses on building conceptual tools for understanding learning systems.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Sections:

(00:00) Intro

(01:25) Getting into AI through Theoretical Computer Science (TCS)

(09:08) Lack of Motivation in TCS and Learning What Research Is

(12:12) Foundational vs Problem-Solving Research, Antipatterns in TCS

(16:30) Theory and Empirics in Deep Learning

(18:30) What is an Empirical Theory of Deep Learning

(28:21) Deep Double Descent

(40:00) Inductive Biases in SGD, epoch-wise double descent

(45:25) Inductive Biases Stick Around

(47:12) Deep Bootstrap

(59:40) Distributional Generalization - Paper Rejections

(1:02:30) Classical Generalization and Distributional Generalization

(1:16:46) Future Work: Studying Structure in Data

(1:20:51) The Tweets^TM

(1:37:00) Outro

Episode Links:

* Preetum’s Homepage

* Preetum’s PhD Thesis

Get full access to The Gradient at thegradientpub.substack.com/subscribe

...more

View all episodes

By Daniel Bashir

4.7

4747 ratings

June 24, 2022

Preetum Nakkiran: An Empirical Theory of Deep Learning

1 hour 37 minutes

In episode 31 of The Gradient Podcast, Daniel Bashir speaks to Preetum Nakkiran.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Sections:

(00:00) Intro

(01:25) Getting into AI through Theoretical Computer Science (TCS)

(09:08) Lack of Motivation in TCS and Learning What Research Is

(12:12) Foundational vs Problem-Solving Research, Antipatterns in TCS

(16:30) Theory and Empirics in Deep Learning

(18:30) What is an Empirical Theory of Deep Learning

(28:21) Deep Double Descent

(40:00) Inductive Biases in SGD, epoch-wise double descent

(45:25) Inductive Biases Stick Around

(47:12) Deep Bootstrap

(59:40) Distributional Generalization - Paper Rejections

(1:02:30) Classical Generalization and Distributional Generalization

(1:16:46) Future Work: Studying Structure in Data

(1:20:51) The Tweets^TM

(1:37:00) Outro

Episode Links:

* Preetum’s Homepage

* Preetum’s PhD Thesis

Get full access to The Gradient at thegradientpub.substack.com/subscribe

...more

More shows like The Gradient: Perspectives on AI

View all

The Joe Rogan Experience

229,570 Listeners

The a16z Show

1,095 Listeners

NVIDIA AI Podcast

345 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,173 Listeners

Practical AI

215 Listeners

The Journal.

6,133 Listeners

All-In with Chamath, Jason, Sacks & Friedberg

10,192 Listeners

Dwarkesh Podcast

566 Listeners

Hard Fork

5,598 Listeners

The Rest Is History

15,784 Listeners

Huberman Lab

29,325 Listeners

Disintegrator

15 Listeners

Practical: AI & Business News

25 Listeners

Share Preetum Nakkiran: An Empirical Theory of Deep Learning

Sign up to save your podcasts

Preetum Nakkiran: An Empirical Theory of Deep Learning

Preetum Nakkiran: An Empirical Theory of Deep Learning

More shows like The Gradient: Perspectives on AI

The Joe Rogan Experience

The a16z Show

NVIDIA AI Podcast

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

Practical AI

The Journal.

All-In with Chamath, Jason, Sacks & Friedberg

Dwarkesh Podcast

Hard Fork

The Rest Is History

Huberman Lab

Disintegrator

Practical: AI & Business News