LessWrong posts by zvi

“On OpenAI’s Preparedness Framework” by Zvi


Listen Later

Previously: On RSPs.

Be Prepared

OpenAI introduces their preparedness framework for safety in frontier models.

A summary of the biggest takeaways, which I will repeat at the end:

  1. I am very happy the preparedness framework exists at all.
  2. I am very happy it is beta and open to revision.
  3. It's very vague and needs fleshing out in several places.
  4. The framework exceeded expectations, with many great features. I updated positively.
  5. I am happy we can talk price, while noting our prices are often still far apart.
  6. Critical thresholds seem too high, if you get this wrong all could be lost. The High threshold for autonomy also seems too high.
  7. The framework relies upon honoring its spirit and not gaming the metrics.
  8. There is still a long way to go. But that is to be expected.
  9. [...]

    ---

    Outline:

    (00:07) Be Prepared

    (02:48) Basic Principles

    (07:33) Veto Power

    (10:27) Introductory Section and Risk Categories

    (13:13) Cybersecurity

    (15:58) CBRN (Chemical, Biological, Radiological and Nuclear) Threats

    (18:47) Persuasion

    (22:24) Model Autonomy

    (25:34) Key Takeaways From Risk Descriptions

    (28:36) Scorecards

    (31:27) Governance

    (34:56) Deployment Restrictions

    (36:21) Development Restrictions

    (39:50) Conclusion and Biggest Takeaways

    ---

    First published:

    December 21st, 2023

    Source:

    https://www.lesswrong.com/posts/hQPfLsDKWtdvMwyyr/on-openai-s-preparedness-framework

    ---

    Narrated by TYPE III AUDIO.

    ...more
    View all episodesView all episodes
    Download on the App Store

    LessWrong posts by zviBy zvi

    • 5
    • 5
    • 5
    • 5
    • 5

    5

    2 ratings


    More shows like LessWrong posts by zvi

    View all
    Making Sense with Sam Harris by Sam Harris

    Making Sense with Sam Harris

    26,273 Listeners

    Conversations with Tyler by Mercatus Center at George Mason University

    Conversations with Tyler

    2,452 Listeners

    The a16z Show by Andreessen Horowitz

    The a16z Show

    1,100 Listeners

    Future of Life Institute Podcast by Future of Life Institute

    Future of Life Institute Podcast

    108 Listeners

    ChinaTalk by Jordan Schneider

    ChinaTalk

    289 Listeners

    Politix by Politix

    Politix

    89 Listeners

    Dwarkesh Podcast by Dwarkesh Patel

    Dwarkesh Podcast

    558 Listeners

    Hard Fork by The New York Times

    Hard Fork

    5,551 Listeners

    Clearer Thinking with Spencer Greenberg by Spencer Greenberg

    Clearer Thinking with Spencer Greenberg

    137 Listeners

    LessWrong (Curated & Popular) by LessWrong

    LessWrong (Curated & Popular)

    12 Listeners

    No Priors: Artificial Intelligence | Technology | Startups by Conviction

    No Priors: Artificial Intelligence | Technology | Startups

    146 Listeners

    "Econ 102" with Noah Smith and Erik Torenberg by Turpentine

    "Econ 102" with Noah Smith and Erik Torenberg

    147 Listeners

    BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

    BG2Pod with Brad Gerstner and Bill Gurley

    472 Listeners

    LessWrong (30+ Karma) by LessWrong

    LessWrong (30+ Karma)

    0 Listeners

    Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

    Complex Systems with Patrick McKenzie (patio11)

    141 Listeners