The Gradient: Perspectives on AI

Kate Park: Data Engines for Vision and Language


Listen Later

In episode 116 of The Gradient Podcast, Daniel Bashir speaks to Kate Park.

Kate is the Director of Product at Scale AI. Prior to joining Scale, Kate worked on Tesla Autopilot as the AI team’s first and lead product manager building the industry’s first data engine. She has also published research on spoken natural language processing and a travel memoir.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast:  Apple Podcasts  | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:11) Kate’s background

* (03:22) Tesla and cameras vs. Lidar, importance of data

* (05:12) “Data is key”

* (07:35) Data vs. architectural improvements

* (09:36) Effort for data scaling

* (10:55) Transfer of capabilities in self-driving

* (13:44) Data flywheels and edge cases, deployment

* (15:48) Transition to Scale

* (18:52) Perspectives on shifting to transformers and data

* (21:00) Data engines for NLP vs. for vision

* (25:32) Model evaluation for LLMs in data engines

* (27:15) InstructGPT and data for RLHF

* (29:15) Benchmark tasks for assessing potential labelers

* (32:07) Biggest challenges for data engines

* (33:40) Expert AI trainers

* (36:22) Future work in data engines

* (38:25) Need for human labeling when bootstrapping new domains or tasks

* (41:05) Outro

Links:

* Scale Data Engine

* OpenAI case study



Get full access to The Gradient at thegradientpub.substack.com/subscribe
...more
View all episodesView all episodes
Download on the App Store

The Gradient: Perspectives on AIBy Daniel Bashir

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

47 ratings


More shows like The Gradient: Perspectives on AI

View all
The Gray Area with Sean Illing by Vox

The Gray Area with Sean Illing

10,685 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

323 Listeners

Practical AI by Practical AI LLC

Practical AI

190 Listeners

Thoughts on the Market by Morgan Stanley

Thoughts on the Market

1,261 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

195 Listeners

Last Week in AI by Skynet Today

Last Week in AI

288 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,050 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

88 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

386 Listeners

Hard Fork by The New York Times

Hard Fork

5,422 Listeners

Raising Health by Andreessen Horowitz, a16z Bio + Health

Raising Health

146 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

15,220 Listeners

Unexplainable by Vox

Unexplainable

2,182 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

76 Listeners

The Ben & Marc Show by Marc Andreessen, Ben Horowitz

The Ben & Marc Show

134 Listeners