
Sign up to save your podcasts
Or
We trained a SAE to find sparse features in image embeddings. We found many meaningful, interpretable, and steerable features. We find that steering image diffusion works surprisingly well and yields predictable and high-quality generations.
You can see the feature library here. We also have an intervention playground you can try.
Key Results
---
Outline:
(00:30) Key Results
(01:20) Interactive demo
(01:33) Introduction
(02:22) Steering Features
(04:22) Discovering and Interpreting Features
(06:22) Autointerpretation Labels
(07:13) Training Details
(08:34) Future Work
---
First published:
Source:
Narrated by TYPE III AUDIO.
We trained a SAE to find sparse features in image embeddings. We found many meaningful, interpretable, and steerable features. We find that steering image diffusion works surprisingly well and yields predictable and high-quality generations.
You can see the feature library here. We also have an intervention playground you can try.
Key Results
---
Outline:
(00:30) Key Results
(01:20) Interactive demo
(01:33) Introduction
(02:22) Steering Features
(04:22) Discovering and Interpreting Features
(06:22) Autointerpretation Labels
(07:13) Training Details
(08:34) Future Work
---
First published:
Source:
Narrated by TYPE III AUDIO.
26,434 Listeners
2,388 Listeners
7,906 Listeners
4,133 Listeners
87 Listeners
1,462 Listeners
9,095 Listeners
87 Listeners
389 Listeners
5,429 Listeners
15,174 Listeners
474 Listeners
121 Listeners
75 Listeners
459 Listeners