
Sign up to save your podcasts
Or
In our new video, we talk about research on interpreting InceptionV1, a convolutional neural network. Researchers have been able to understand the function of neurons and channels inside the network and uncover visual processing algorithms by looking at the weights. The work on InceptionV1 is early but landmark mechanistic interpretability research, and it functions well as an introduction to the field. We also go into the rationale and goals of the field and mention some more recent research near the end. Our main source material is the circuits thread in the Distill journal and this article on feature visualization. The author of the script is Arthur Frost. I have included the script below, although I recommend watching the video since the script has been written with accompanying moving visuals in mind.
Intro
In 2018, researchers trained an AI to find out if people were at [...]
---
Outline:
(00:56) Intro
(07:16) Visualisation by Optimisation
(11:09) Circuits
(15:27) Polysemanticity
(17:00) Closing thoughts, and the past few years of interpretability
---
First published:
Source:
Narrated by TYPE III AUDIO.
In our new video, we talk about research on interpreting InceptionV1, a convolutional neural network. Researchers have been able to understand the function of neurons and channels inside the network and uncover visual processing algorithms by looking at the weights. The work on InceptionV1 is early but landmark mechanistic interpretability research, and it functions well as an introduction to the field. We also go into the rationale and goals of the field and mention some more recent research near the end. Our main source material is the circuits thread in the Distill journal and this article on feature visualization. The author of the script is Arthur Frost. I have included the script below, although I recommend watching the video since the script has been written with accompanying moving visuals in mind.
Intro
In 2018, researchers trained an AI to find out if people were at [...]
---
Outline:
(00:56) Intro
(07:16) Visualisation by Optimisation
(11:09) Circuits
(15:27) Polysemanticity
(17:00) Closing thoughts, and the past few years of interpretability
---
First published:
Source:
Narrated by TYPE III AUDIO.
26,434 Listeners
2,388 Listeners
7,906 Listeners
4,133 Listeners
87 Listeners
1,462 Listeners
9,095 Listeners
87 Listeners
389 Listeners
5,429 Listeners
15,174 Listeners
474 Listeners
121 Listeners
75 Listeners
459 Listeners