LessWrong (Curated & Popular)

“Principles for the AGI Race ” by William_S


Listen Later

Crossposted from https://williamrsaunders.substack.com/p/principles-for-the-agi-race

Why form principles for the AGI Race?

I worked at OpenAI for 3 years, on the Alignment and Superalignment teams. Our goal was to prepare for the possibility that OpenAI succeeded in its stated mission of building AGI (Artificial General Intelligence, roughly able to do most things a human can do), and then proceed on to make systems smarter than most humans. This will predictably face novel problems in controlling and shaping systems smarter than their supervisors and creators, which we don't currently know how to solve. It's not clear when this will happen, but a number of people would throw around estimates of this happening within a few years.

While there, I would sometimes dream about what would have happened if I’d been a nuclear physicist in the 1940s. I do think that many of the kind of people who get involved in the effective [...]

---

Outline:

(00:06) Why form principles for the AGI Race?

(03:32) Bad High Risk Decisions

(04:46) Unnecessary Races to Develop Risky Technology

(05:17) High Risk Decision Principles

(05:21) Principle 1: Seek as broad and legitimate authority for your decisions as is possible under the circumstances

(07:20) Principle 2: Don’t take actions which impose significant risks to others without overwhelming evidence of net benefit

(10:52) Race Principles

(10:56) What is a Race?

(12:18) Principle 3: When racing, have an exit strategy

(13:03) Principle 4: Maintain accurate race intelligence at all times.

(14:23) Principle 5: Evaluate how bad it is for your opponent to win instead of you, and balance this against the risks of racing

(15:07) Principle 6: Seriously attempt alternatives to racing

(16:58) Meta Principles

(17:01) Principle 7: Don’t give power to people or structures that can’t be held accountable.

(18:36) Principle 8: Notice when you can’t uphold your own principles.

(19:17) Application of my Principles

(19:21) Working at OpenAI

(24:19) SB 1047

(28:32) Call to Action

---

First published:
August 30th, 2024

Source:
https://www.lesswrong.com/posts/aRciQsjgErCf5Y7D9/principles-for-the-agi-race

---

Narrated by TYPE III AUDIO.

...more
View all episodesView all episodes
Download on the App Store

LessWrong (Curated & Popular)By LessWrong

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

11 ratings


More shows like LessWrong (Curated & Popular)

View all
Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,382 Listeners

Astral Codex Ten Podcast by Jeremiah

Astral Codex Ten Podcast

122 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll | Wondery

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,079 Listeners

ManifoldOne by Steve Hsu

ManifoldOne

87 Listeners

The Jim Rutt Show by The Jim Rutt Show

The Jim Rutt Show

249 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

90 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

324 Listeners

Hard Fork by The New York Times

Hard Fork

5,368 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

137 Listeners

Razib Khan's Unsupervised Learning by Razib Khan

Razib Khan's Unsupervised Learning

200 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

103 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

64 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

138 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

101 Listeners

LessWrong posts by zvi by zvi

LessWrong posts by zvi

0 Listeners