
Sign up to save your podcasts
Or
Audio note: this article contains 55 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description.
Introduction
This is the first installment of my January writing project. We will look at generative neural networks from the framework of (probabilistic) "formal grammars", specifically focusing on building a complex grammar out of simple “rule grammars”. This turns out to lead to a nice, and relatively non-technical way of discussing how complex systems like language models can be built out of "heuristics". Thinking more about how these blocks are combined (and focussing on the difference between combining rules via "AND" vs. "OR") leads to some new insights on generalization, which are sometimes lost in the fuzzy language of heuristics and circuits. In a follow-up to this post we'll apply the tools introduced here in a more [...]
---
Outline:
(00:18) Introduction
(01:29) Grammars, subgrammars, and circuits
(03:33) Subgrammars.
(05:03) Probabilistic grammars and a tiny bit of linguistics
(10:12) Heuristics
(13:19) Rules, subgrammars and logits
(18:28) Subgrammars in mechinterp
(18:44) Modular addition: definition
(20:42) Modular addition: circuits as subgrammars
(24:24) Upshots so far
(26:04) Memorization and beyond
(31:29) Future directions
The original text contained 2 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
Audio note: this article contains 55 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description.
Introduction
This is the first installment of my January writing project. We will look at generative neural networks from the framework of (probabilistic) "formal grammars", specifically focusing on building a complex grammar out of simple “rule grammars”. This turns out to lead to a nice, and relatively non-technical way of discussing how complex systems like language models can be built out of "heuristics". Thinking more about how these blocks are combined (and focussing on the difference between combining rules via "AND" vs. "OR") leads to some new insights on generalization, which are sometimes lost in the fuzzy language of heuristics and circuits. In a follow-up to this post we'll apply the tools introduced here in a more [...]
---
Outline:
(00:18) Introduction
(01:29) Grammars, subgrammars, and circuits
(03:33) Subgrammars.
(05:03) Probabilistic grammars and a tiny bit of linguistics
(10:12) Heuristics
(13:19) Rules, subgrammars and logits
(18:28) Subgrammars in mechinterp
(18:44) Modular addition: definition
(20:42) Modular addition: circuits as subgrammars
(24:24) Upshots so far
(26:04) Memorization and beyond
(31:29) Future directions
The original text contained 2 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
26,331 Listeners
2,396 Listeners
7,954 Listeners
4,130 Listeners
87 Listeners
1,446 Listeners
8,759 Listeners
88 Listeners
355 Listeners
5,410 Listeners
15,313 Listeners
469 Listeners
123 Listeners
76 Listeners
445 Listeners