October 06, 2024

“AISafety.info: What is the ‘natural abstractions hypothesis’?” by Algon

Listen Later

6 minutes

Introduction

The natural abstraction hypothesis (NAH) claims:

Our physical world “abstracts well” into high-level abstractions of low-level systems.
These abstractions are “natural” in the sense that many different kinds of learning processes acquire and use them.
These abstractions approximately correspond to the concepts used by humans.

If the NAH is true, AI alignment could be dramatically simplified, as it implies that any cognition a very powerful AI uses will be in terms of concepts that humans can understand.[1]

Explanation of the natural abstraction hypothesis

Let's unpack that definition. First, what do we mean by “our physical world abstracts well”? Just that for most things in the world, the information that describes how the thing interacts with other stuff “far away” from the system is much lower-dimensional (i.e., described by fewer numbers) than the thing itself. “Far away” can refer to many kinds of separation, including physical[2], conceptual, or [...]

---

Outline:

(00:06) Introduction

(00:45) Explanation of the natural abstraction hypothesis

(03:12) Why the natural abstraction hypothesis is important for alignment

(04:44) Is NAH true?

The original text contained 6 footnotes which were omitted from this narration.

---

First published:

October 5th, 2024

Source:

https://www.lesswrong.com/posts/m8ub3guD2cxZ4CrNv/aisafety-info-what-is-the-natural-abstractions-hypothesis

---

Narrated by TYPE III AUDIO.

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

LessWrong (30+ Karma)

By LessWrong

October 06, 2024

“AISafety.info: What is the ‘natural abstractions hypothesis’?” by Algon

Listen Later

6 minutes

Introduction

The natural abstraction hypothesis (NAH) claims:

Our physical world “abstracts well” into high-level abstractions of low-level systems.
These abstractions are “natural” in the sense that many different kinds of learning processes acquire and use them.
These abstractions approximately correspond to the concepts used by humans.

If the NAH is true, AI alignment could be dramatically simplified, as it implies that any cognition a very powerful AI uses will be in terms of concepts that humans can understand.[1]

Explanation of the natural abstraction hypothesis

Let's unpack that definition. First, what do we mean by “our physical world abstracts well”? Just that for most things in the world, the information that describes how the thing interacts with other stuff “far away” from the system is much lower-dimensional (i.e., described by fewer numbers) than the thing itself. “Far away” can refer to many kinds of separation, including physical[2], conceptual, or [...]

---

Outline:

(00:06) Introduction

(00:45) Explanation of the natural abstraction hypothesis

(03:12) Why the natural abstraction hypothesis is important for alignment

(04:44) Is NAH true?

The original text contained 6 footnotes which were omitted from this narration.

---

First published:

October 5th, 2024

Source:

https://www.lesswrong.com/posts/m8ub3guD2cxZ4CrNv/aisafety-info-what-is-the-natural-abstractions-hypothesis

---

Narrated by TYPE III AUDIO.

...more

More shows like LessWrong (30+ Karma)

The Daily by The New York Times

The Daily

112,956 Listeners

Astral Codex Ten Podcast by Jeremiah

Astral Codex Ten Podcast

132 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,290 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

548 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,362 Listeners

AI Article Readings by Readings of great articles in AI voices

AI Article Readings

4 Listeners

Doom Debates by Liron Shapira

Doom Debates

14 Listeners

LessWrong posts by zvi by zvi

LessWrong posts by zvi

2 Listeners