
Sign up to save your podcasts
Or
Bart, Michael and Patrick are joint first authors. Research conducted as part of MATS 6.0 in Lee Sharkey and Neel Nanda's streams. Thanks to Mckenna Fitzgerald and Robert Krzyzanowski for their feedback!
TL;DR:
---
Outline:
(02:36) Introduction
(07:41) Defining Meta-SAEs
(09:46) Meta-latents form interpretable decompositions of SAE latents
(13:04) Are Meta-Latents different from SAE Latents?
(19:56) Using Meta-SAEs to Interpret Split Features
(24:02) Causally Intervening and Making Targeted Edits with Meta-Latents
(29:03) Discussion
The original text contained 2 footnotes which were omitted from this narration.
The original text contained 2 images which were described by AI.
---
First published:
Source:
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
Bart, Michael and Patrick are joint first authors. Research conducted as part of MATS 6.0 in Lee Sharkey and Neel Nanda's streams. Thanks to Mckenna Fitzgerald and Robert Krzyzanowski for their feedback!
TL;DR:
---
Outline:
(02:36) Introduction
(07:41) Defining Meta-SAEs
(09:46) Meta-latents form interpretable decompositions of SAE latents
(13:04) Are Meta-Latents different from SAE Latents?
(19:56) Using Meta-SAEs to Interpret Split Features
(24:02) Causally Intervening and Making Targeted Edits with Meta-Latents
(29:03) Discussion
The original text contained 2 footnotes which were omitted from this narration.
The original text contained 2 images which were described by AI.
---
First published:
Source:
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
26,409 Listeners
2,387 Listeners
7,908 Listeners
4,131 Listeners
87 Listeners
1,457 Listeners
9,042 Listeners
87 Listeners
388 Listeners
5,432 Listeners
15,201 Listeners
474 Listeners
122 Listeners
75 Listeners
454 Listeners