December 22, 2024

“When AI 10x’s AI R&D, What Do We Do?” by Logan Riggs

Listen Later

8 minutes

Note: below is a hypothetical future written in strong terms and does not track my actual probabilities.

Throughout 2025, a huge amount of compute is spent on producing data in verifiable tasks, such as math[1] (w/ "does it compile as a proof?" being the ground truth label) and code (w/ "does it compile and past unit tests?" being the ground truth label).

In 2026, when the next giant compute clusters w/ their GB200's are built, labs train the next larger model over 100 days, then some extra RL(H/AI)F and whatever else they've cooked up by then.

By mid-2026, we have a model that is very generally intelligent, that is superhuman in coding and math proofs.

Naively, 10x-ing research means releasing 10x the same quality amount of papers in a year; however, these new LLM's have a different skill profile, allowing different types of research and workflows.

If [...]

---

Outline:

(02:11) Scale Capabilities Safely

(02:40) Step 1: Hardening Defenses and More Control

(03:18) Step 2: Automate Interp

(07:43) Conclusion

The original text contained 3 footnotes which were omitted from this narration.

---

First published:

December 21st, 2024

Source:

https://www.lesswrong.com/posts/SvvYCH6JrLDT8iauA/when-ai-10x-s-ai-r-and-d-what-do-we-do

---

Narrated by TYPE III AUDIO.

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

LessWrong (30+ Karma)

By LessWrong

December 22, 2024

“When AI 10x’s AI R&D, What Do We Do?” by Logan Riggs

Listen Later

8 minutes

Note: below is a hypothetical future written in strong terms and does not track my actual probabilities.

Throughout 2025, a huge amount of compute is spent on producing data in verifiable tasks, such as math[1] (w/ "does it compile as a proof?" being the ground truth label) and code (w/ "does it compile and past unit tests?" being the ground truth label).

In 2026, when the next giant compute clusters w/ their GB200's are built, labs train the next larger model over 100 days, then some extra RL(H/AI)F and whatever else they've cooked up by then.

By mid-2026, we have a model that is very generally intelligent, that is superhuman in coding and math proofs.

Naively, 10x-ing research means releasing 10x the same quality amount of papers in a year; however, these new LLM's have a different skill profile, allowing different types of research and workflows.

If [...]

---

Outline:

(02:11) Scale Capabilities Safely

(02:40) Step 1: Hardening Defenses and More Control

(03:18) Step 2: Automate Interp

(07:43) Conclusion

The original text contained 3 footnotes which were omitted from this narration.

---

First published:

December 21st, 2024

Source:

https://www.lesswrong.com/posts/SvvYCH6JrLDT8iauA/when-ai-10x-s-ai-r-and-d-what-do-we-do

---

Narrated by TYPE III AUDIO.

...more

More shows like LessWrong (30+ Karma)

The Daily by The New York Times

The Daily

112,193 Listeners

Astral Codex Ten Podcast by Jeremiah

Astral Codex Ten Podcast

131 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,227 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

564 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,216 Listeners

AI Article Readings by Readings of great articles in AI voices

AI Article Readings

4 Listeners

Doom Debates! by Liron Shapira

Doom Debates!

14 Listeners

LessWrong posts by zvi by zvi

LessWrong posts by zvi

2 Listeners