LessWrong (30+ Karma)

“Why Care About Natural Latents?” by johnswentworth, David Lorell


Listen Later

Suppose Alice and Bob are two Bayesian agents in the same environment. They both basically understand how their environment works, so they generally agree on predictions about any specific directly-observable thing in the world - e.g. whenever they try to operationalize a bet, they find that their odds are roughly the same. However, their two world models might have totally different internal structure, different “latent” structures which Alice and Bob model as generating the observable world around them. As a simple toy example: maybe Alice models a bunch of numbers as having been generated by independent rolls of the same biased die, and Bob models the same numbers using some big complicated neural net.

Now suppose Alice goes poking around inside of her world model, and somewhere in there she finds a latent variable _Lambda_A_ with two properties (the Natural Latent properties):

  • _Lambda_A_ approximately mediates between two different [...]

---

Outline:

(04:23) So What Could We Do With That?

(04:35) Interpretability

(06:29) Value Learning and The Pointers Problem

(08:03) Where We’re Currently Headed With This

The original text contained 1 footnote which was omitted from this narration.

---

First published:

May 9th, 2024

Source:

https://www.lesswrong.com/posts/RTiuLzusJWyepFpbN/why-care-about-natural-latents

---

Narrated by TYPE III AUDIO.

...more
View all episodesView all episodes
Download on the App Store

LessWrong (30+ Karma)By LessWrong


More shows like LessWrong (30+ Karma)

View all
The Daily by The New York Times

The Daily

113,432 Listeners

Astral Codex Ten Podcast by Jeremiah

Astral Codex Ten Podcast

130 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,273 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

517 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,435 Listeners

AI Article Readings by Readings of great articles in AI voices

AI Article Readings

4 Listeners

Doom Debates by Liron Shapira

Doom Debates

14 Listeners

LessWrong posts by zvi by zvi

LessWrong posts by zvi

2 Listeners