June 18, 2021

Jacob Steinhardt, UC Berkeley: Machine learning safety, alignment and measurement

Listen Later

59 minutes

Jacob Steinhardt (Google Scholar) (Website) is an assistant professor at UC Berkeley. His main research interest is in designing machine learning systems that are reliable and aligned with human values. Some of his specific research directions include robustness, rewards specification and reward hacking, as well as scalable alignment.

Highlights:

📜“Test accuracy is a very limited metric.”

👨‍👩‍👧‍👦“You might not be able to get lots of feedback on human values.”

📊“I’m interested in measuring the progress in AI capabilities.”

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Generally Intelligent

By Kanjun Qiu

4.8

1616 ratings

June 18, 2021

Jacob Steinhardt, UC Berkeley: Machine learning safety, alignment and measurement

Listen Later

59 minutes

Jacob Steinhardt (Google Scholar) (Website) is an assistant professor at UC Berkeley. His main research interest is in designing machine learning systems that are reliable and aligned with human values. Some of his specific research directions include robustness, rewards specification and reward hacking, as well as scalable alignment.

Highlights:

📜“Test accuracy is a very limited metric.”

👨‍👩‍👧‍👦“You might not be able to get lots of feedback on human values.”

📊“I’m interested in measuring the progress in AI capabilities.”

...more

More shows like Generally Intelligent

The Tim Ferriss Show by Tim Ferriss: Bestselling Author, Human Guinea Pig

The Tim Ferriss Show

16,167 Listeners

The Daily by The New York Times

The Daily

112,401 Listeners

Worklife with Adam Grant by TED

Worklife with Adam Grant

9,165 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,015 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

471 Listeners