
Sign up to save your podcasts
Or
---
Outline:
(05:43) Experiment Overview
(05:47) Open WebText next-token prediction (randomly sampled data)
(07:27) Benchmarks (correct answer prediction)
(08:55) How does context length affect SAE performance on randomly sampled data?
(12:08) SAE Downstream Error Propagation
(14:27) How does using SAE output in place of activations affect model performance on the Lambada Benchmark?
(22:07) Takeaways
(23:28) Future work
(26:01) Code
(26:15) Acknowledgments
(27:56) Citing this post
(28:02) Appendix: Children's Book Test (CBT, Common Noun \[CN\] split)
---
First published:
Source:
Narrated by TYPE III AUDIO.
---
Outline:
(05:43) Experiment Overview
(05:47) Open WebText next-token prediction (randomly sampled data)
(07:27) Benchmarks (correct answer prediction)
(08:55) How does context length affect SAE performance on randomly sampled data?
(12:08) SAE Downstream Error Propagation
(14:27) How does using SAE output in place of activations affect model performance on the Lambada Benchmark?
(22:07) Takeaways
(23:28) Future work
(26:01) Code
(26:15) Acknowledgments
(27:56) Citing this post
(28:02) Appendix: Children's Book Test (CBT, Common Noun \[CN\] split)
---
First published:
Source:
Narrated by TYPE III AUDIO.
26,446 Listeners
2,389 Listeners
7,910 Listeners
4,136 Listeners
87 Listeners
1,462 Listeners
9,095 Listeners
87 Listeners
389 Listeners
5,438 Listeners
15,220 Listeners
475 Listeners
121 Listeners
75 Listeners
461 Listeners