
Sign up to save your podcasts
Or
Audio note: this article contains 125 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description.
This post depends on a basic understanding of history-based reinforcement learning and the AIXI model.
I am grateful to Marcus Hutter and the lesswrong team for early feedback, though any remaining errors are mine.
The universal agent AIXI treats the environment it interacts with like a video game it is playing; the actions it chooses at each step are like hitting buttons and the percepts it receives are like images on the screen (observations) and an unambiguous point tally (rewards). It has been suggested that since AIXI is inherently dualistic and doesn't believe anything in the environment can "directly" hurt it, if it were embedded in the real world it would eventually drop an anvil on its [...]
---
Outline:
(10:28) Formal Equivalence with an Uncorrupted AIXI
(15:16) Closing Thoughts
The original text contained 3 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
Audio note: this article contains 125 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description.
This post depends on a basic understanding of history-based reinforcement learning and the AIXI model.
I am grateful to Marcus Hutter and the lesswrong team for early feedback, though any remaining errors are mine.
The universal agent AIXI treats the environment it interacts with like a video game it is playing; the actions it chooses at each step are like hitting buttons and the percepts it receives are like images on the screen (observations) and an unambiguous point tally (rewards). It has been suggested that since AIXI is inherently dualistic and doesn't believe anything in the environment can "directly" hurt it, if it were embedded in the real world it would eventually drop an anvil on its [...]
---
Outline:
(10:28) Formal Equivalence with an Uncorrupted AIXI
(15:16) Closing Thoughts
The original text contained 3 footnotes which were omitted from this narration.
---
First published:
Source:
Narrated by TYPE III AUDIO.
26,401 Listeners
2,388 Listeners
7,925 Listeners
4,132 Listeners
87 Listeners
1,456 Listeners
9,045 Listeners
86 Listeners
388 Listeners
5,427 Listeners
15,207 Listeners
474 Listeners
123 Listeners
75 Listeners
455 Listeners