
Sign up to save your podcasts
Or


---
Outline:
(05:43) Experiment Overview
(05:47) Open WebText next-token prediction (randomly sampled data)
(07:27) Benchmarks (correct answer prediction)
(08:55) How does context length affect SAE performance on randomly sampled data?
(12:08) SAE Downstream Error Propagation
(14:27) How does using SAE output in place of activations affect model performance on the Lambada Benchmark?
(22:07) Takeaways
(23:28) Future work
(26:01) Code
(26:15) Acknowledgments
(27:56) Citing this post
(28:02) Appendix: Children's Book Test (CBT, Common Noun \[CN\] split)
---
First published:
Source:
Narrated by TYPE III AUDIO.
By LessWrong---
Outline:
(05:43) Experiment Overview
(05:47) Open WebText next-token prediction (randomly sampled data)
(07:27) Benchmarks (correct answer prediction)
(08:55) How does context length affect SAE performance on randomly sampled data?
(12:08) SAE Downstream Error Propagation
(14:27) How does using SAE output in place of activations affect model performance on the Lambada Benchmark?
(22:07) Takeaways
(23:28) Future work
(26:01) Code
(26:15) Acknowledgments
(27:56) Citing this post
(28:02) Appendix: Children's Book Test (CBT, Common Noun \[CN\] split)
---
First published:
Source:
Narrated by TYPE III AUDIO.

113,164 Listeners

130 Listeners

7,255 Listeners

535 Listeners

16,266 Listeners

4 Listeners

14 Listeners

2 Listeners