
Sign up to save your podcasts
Or


Nick and Lily are co-first authors on this project. Lewis and Neel jointly supervised this project.
TL;DR
---
Outline:
(00:22) TL;DR
(01:48) Introduction
(04:41) Preliminaries
(06:09) Data Diffing
(07:16) Identifying known differences from datasets
(09:09) Discovering novel differences between model behavior
(14:26) Correlations
(16:21) Finding known correlations
(17:45) Finding unknown correlations
(17:58) Finding bias in internet comments
(19:52) Finding patterns in model responses
(20:51) Clustering
(22:39) Discovering known clusters
(24:26) Discovering unknown clusters
(26:13) Retrieval
(33:45) Discussion and Limitations
(35:06) Awknowledgments
The original text contained 6 footnotes which were omitted from this narration.
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
 By LessWrong
By LessWrongNick and Lily are co-first authors on this project. Lewis and Neel jointly supervised this project.
TL;DR
---
Outline:
(00:22) TL;DR
(01:48) Introduction
(04:41) Preliminaries
(06:09) Data Diffing
(07:16) Identifying known differences from datasets
(09:09) Discovering novel differences between model behavior
(14:26) Correlations
(16:21) Finding known correlations
(17:45) Finding unknown correlations
(17:58) Finding bias in internet comments
(19:52) Finding patterns in model responses
(20:51) Clustering
(22:39) Discovering known clusters
(24:26) Discovering unknown clusters
(26:13) Retrieval
(33:45) Discussion and Limitations
(35:06) Awknowledgments
The original text contained 6 footnotes which were omitted from this narration.
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

26,397 Listeners

2,423 Listeners

8,719 Listeners

4,149 Listeners

92 Listeners

1,585 Listeners

9,810 Listeners

90 Listeners

491 Listeners

5,467 Listeners

15,991 Listeners

541 Listeners

132 Listeners

95 Listeners

497 Listeners