February 01, 2026

“Some thoughts on what would make me endorse an AGI lab” by Eli Tyre

8 minutes

I’ve been feeling more positive about “the idea of Anthropic” lately, as distinct from the actual company of Anthropic.

An argument for a safety-focused, science-focused commercial frontier scaling lab

I largely buy the old school LessWrong arguments of instrumental convergence and instrumental opacity that suggest catastrophic misalignment, especially of powerful superintelligences. However, I don’t particularly think that those arguments meet the standard of evidence necessary for the world to implement approximately unprecedented policies like “establish an international treaty that puts a global moratorium on frontier AI development.” [1]

If I were king of the world, those arguments would be sufficient reason to shape the laws of my global monarchy. Specifically, I would institute a policy in which we approach Superintelligence much more slowly and carefully, including, many separate pauses in which we thoroughly test the current models before moving forward with increasing frontier capabilities. But I’m not the king of the world, and I don’t have the affordance to implement nuanced policies that reflect the risks and uncertainties of the situation.

Given the actual governance machinery available, it seems to me that reducing our collective uncertainty about the properties of AI systems is at least helpful, and [...]

The original text contained 2 footnotes which were omitted from this narration.

---

First published:

January 31st, 2026

Source:

https://www.lesswrong.com/posts/Pb8uh7xRTP8KhbeTM/some-thoughts-on-what-would-make-me-endorse-an-agi-lab

---

Narrated by TYPE III AUDIO.

...more

View all episodes

By LessWrong