
Sign up to save your podcasts
Or


Or: When Memories Get Good -- The Default Path Without Theoretical Breakthroughs
Epistemic status: Fairly confident in the core thesis (context + memory can substitute for weight updates for most practical purposes). The RL training loop is a sketch, not a tested proposal. I haven't done a thorough literature review.
Suppose there are no major breakthroughs in continual learning -- that is, suppose we continue to struggle at using information gathered at runtime to update the weights of a given instance of an AI model. If you try to update the weights at runtime today, usually you end up with catastrophic forgetting, or you find you can only make very small updates with the tiny amount of useful data you have
So, if you can’t train a day's worth of information into the model, how could you end up with something that functions as if it were learning on the job?
Long Context Lengths, High Quality Summaries, and Detailed Documentation
It's a straightforward idea, and basically done today, just not particularly well yet. Laying it out:
The original text contained 16 footnotes which were omitted from this narration.
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
By LessWrongOr: When Memories Get Good -- The Default Path Without Theoretical Breakthroughs
Epistemic status: Fairly confident in the core thesis (context + memory can substitute for weight updates for most practical purposes). The RL training loop is a sketch, not a tested proposal. I haven't done a thorough literature review.
Suppose there are no major breakthroughs in continual learning -- that is, suppose we continue to struggle at using information gathered at runtime to update the weights of a given instance of an AI model. If you try to update the weights at runtime today, usually you end up with catastrophic forgetting, or you find you can only make very small updates with the tiny amount of useful data you have
So, if you can’t train a day's worth of information into the model, how could you end up with something that functions as if it were learning on the job?
Long Context Lengths, High Quality Summaries, and Detailed Documentation
It's a straightforward idea, and basically done today, just not particularly well yet. Laying it out:
The original text contained 16 footnotes which were omitted from this narration.
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

112,326 Listeners

130 Listeners

7,242 Listeners

559 Listeners

16,321 Listeners

4 Listeners

14 Listeners

2 Listeners