May 23, 2024

Episode 12.21: Interpretability, sparseness, and vector manipulation. Mono- and polysemanticity.

33 minutes

More ‘caveat emptor’ than usual in this speculative episode based on three very important papers from Anthropic AI on interpretability, aka making sense of neural nets.

...more

View all episodes

By John Puddefoot

May 23, 2024

Episode 12.21: Interpretability, sparseness, and vector manipulation. Mono- and polysemanticity.

33 minutes

More ‘caveat emptor’ than usual in this speculative episode based on three very important papers from Anthropic AI on interpretability, aka making sense of neural nets.

...more

Share Episode 12.21: Interpretability, sparseness, and vector manipulation. Mono- and polysemanticity.

Sign up to save your podcasts

Episode 12.21: Interpretability, sparseness, and vector manipulation. Mono- and polysemanticity.

Episode 12.21: Interpretability, sparseness, and vector manipulation. Mono- and polysemanticity.