September 05, 2023

AF - What I would do if I wasn't at ARC Evals by Lawrence Chan

21 minutes

Link to original article

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: What I would do if I wasn't at ARC Evals, published by Lawrence Chan on September 5, 2023 on The AI Alignment Forum.

In which: I list 9 projects that I would work on if I wasn't busy working on safety standards at ARC Evals, and explain why they might be good to work on.

Epistemic status: I'm prioritizing getting this out fast as opposed to writing it carefully. I've thought for at least a few hours and talked to a few people I trust about each of the following projects, but I haven't done that much digging into each of these, and it's likely that I'm wrong about many material facts. I also make little claim to the novelty of the projects. I'd recommend looking into these yourself before committing to doing them. (Total time spent writing or editing this post: ~8 hours.)

Standard disclaimer: I'm writing this in my own capacity. The views expressed are my own, and should not be taken to represent the views of ARC/FAR/LTFF/Lightspeed or any other org or program I'm involved with.

Thanks to Ajeya Cotra, Caleb Parikh, Chris Painter, Daniel Filan, Rachel Freedman, Rohin Shah, Thomas Kwa, and others for comments and feedback.

Introduction

I'm currently working as a researcher on the Alignment Research Center Evaluations Team (ARC Evals), where I'm working on lab safety standards. I'm reasonably sure that this is one of the most useful things I could be doing with my life.

Unfortunately, there's a lot of problems to solve in the world, and lots of balls that are being dropped, that I don't have time to get to thanks to my day job. Here's an unsorted and incomplete list of projects that I would consider doing if I wasn't at ARC Evals:

Ambitious mechanistic interpretability.

Getting people to write papers/writing papers myself.

Creating concrete projects and research agendas.

Working on OP's funding bottleneck.

Working on everyone else's funding bottleneck.

Running the Long-Term Future Fund.

Onboarding senior(-ish) academics and research engineers.

Extending the young-EA mentorship pipeline.

Writing blog posts/giving takes.

I've categorized these projects into three broad categories and will discuss each in turn below. For each project, I'll also list who I think should work on them, as well as some of my key uncertainties. Note that this document isn't really written for myself to decide between projects, but instead as a list of some promising projects for someone with a similar skillset to me. As such, there's not much discussion of personal fit.

If you're interested in working on any of the projects, please reach out or post in the comments below!

Relevant beliefs I have

Before jumping into the projects I think people should work on, I think it's worth outlining some of my core beliefs that inform my thinking and project selection:

Importance of A(G)I safety: I think A(G)I Safety is one of the most important problems to work on, and all the projects below are thus aimed at AI Safety.

Value beyond technical research: Technical AI Safety (AIS) research is crucial, but other types of work are valuable as well. Efforts aimed at improving AI governance, grantmaking, and community building are important and we should give more credit to those doing good work in those areas.

High discount rate for current EA/AIS funding: There's several reasons for this: first, EA/AIS Funders are currently in a unique position due to a surge in AI Safety interest without a proportional increase in funding. I expect this dynamic to change and our influence to wane as additional funding and governments enter this space. Second, efforts today are important for paving the path to future efforts in the future. Third, my timelines are relatively short, which increases the importance of current funding.

Building a robust EA/AIS ecosystem: The EA/AIS ecosystem should be more prepared for unpredictable s...

...more

View all episodes

By The Nonlinear Fund

September 05, 2023

AF - What I would do if I wasn't at ARC Evals by Lawrence Chan

21 minutes

In which: I list 9 projects that I would work on if I wasn't busy working on safety standards at ARC Evals, and explain why they might be good to work on.

Thanks to Ajeya Cotra, Caleb Parikh, Chris Painter, Daniel Filan, Rachel Freedman, Rohin Shah, Thomas Kwa, and others for comments and feedback.

Introduction

Ambitious mechanistic interpretability.

Getting people to write papers/writing papers myself.

Creating concrete projects and research agendas.

Working on OP's funding bottleneck.

Working on everyone else's funding bottleneck.

Running the Long-Term Future Fund.

Onboarding senior(-ish) academics and research engineers.

Extending the young-EA mentorship pipeline.

Writing blog posts/giving takes.

If you're interested in working on any of the projects, please reach out or post in the comments below!

Relevant beliefs I have

Before jumping into the projects I think people should work on, I think it's worth outlining some of my core beliefs that inform my thinking and project selection:

Importance of A(G)I safety: I think A(G)I Safety is one of the most important problems to work on, and all the projects below are thus aimed at AI Safety.

Building a robust EA/AIS ecosystem: The EA/AIS ecosystem should be more prepared for unpredictable s...

...more

More shows like The Nonlinear Library: Alignment Forum

View all

AXRP - the AI X-risk Research Podcast

9 Listeners

Share AF - What I would do if I wasn't at ARC Evals by Lawrence Chan

Sign up to save your podcasts

AF - What I would do if I wasn't at ARC Evals by Lawrence Chan

AF - What I would do if I wasn't at ARC Evals by Lawrence Chan

More shows like The Nonlinear Library: Alignment Forum

AXRP - the AI X-risk Research Podcast