LessWrong (30+ Karma)

“On OpenAI’s Model Spec 2.0” by Zvi


Listen Later

OpenAI made major revisions to their Model Spec.

It seems very important to get this right, so I’m going into the weeds.

This post thus gets farther into the weeds than most people need to go. I recommend most of you read at most the sections of Part 1 that interest you, and skip Part 2.

I looked at the first version last year. I praised it as a solid first attempt.

Table of Contents

  1. Part 1
  2. Conceptual Overview.
  3. Change Log.
  4. Summary of the Key Rules.
  5. Three Goals.
  6. Three Risks.
  7. The Chain of Command.
  8. The Letter and the Spirit.
  9. Part 2
  10. Stay in Bounds: Platform Rules.
  11. The Only Developer Rule.
  12. Mental Health.
  13. What is on the Agenda.
  14. Liar Liar.
  15. Still Kind of a Liar Liar.
  16. Well, Yes [...]
  17. ---

    Outline:

    (00:30) Part 1

    (00:33) Conceptual Overview

    (05:51) Change Log

    (07:25) Summary of the Key Rules

    (11:49) Three Goals

    (15:51) Three Risks

    (20:07) The Chain of Command

    (26:14) The Letter and the Spirit

    (29:30) Part 2

    (29:33) Stay in Bounds: Platform Rules

    (47:19) The Only Developer Rule

    (49:19) Mental Health

    (50:38) What is on the Agenda

    (56:35) Liar Liar

    (01:01:56) Still Kind of a Liar Liar

    (01:07:42) Well, Yes, Okay, Sure

    (01:10:14) I Am a Good Nice Bot

    (01:20:55) A Conscious Choice

    (01:21:49) Part 3

    (01:21:52) The Super Secret Instructions

    (01:24:45) The Super Secret Model Spec Details

    (01:27:43) A Final Note

    ---

    First published:

    February 21st, 2025

    Source:

    https://www.lesswrong.com/posts/ntQYby9G8A85cEeY6/on-openai-s-model-spec-2-0

    ---

    Narrated by TYPE III AUDIO.

    ---

    Images from the article:

    Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    ...more
    View all episodesView all episodes
    Download on the App Store

    LessWrong (30+ Karma)By LessWrong


    More shows like LessWrong (30+ Karma)

    View all
    Making Sense with Sam Harris by Sam Harris

    Making Sense with Sam Harris

    26,344 Listeners

    Conversations with Tyler by Mercatus Center at George Mason University

    Conversations with Tyler

    2,444 Listeners

    The Peter Attia Drive by Peter Attia, MD

    The Peter Attia Drive

    9,132 Listeners

    Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll | Wondery

    Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

    4,153 Listeners

    ManifoldOne by Steve Hsu

    ManifoldOne

    92 Listeners

    Your Undivided Attention by The Center for Humane Technology, Tristan Harris, Daniel Barcay and Aza Raskin

    Your Undivided Attention

    1,597 Listeners

    All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

    All-In with Chamath, Jason, Sacks & Friedberg

    9,901 Listeners

    Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

    Machine Learning Street Talk (MLST)

    90 Listeners

    Dwarkesh Podcast by Dwarkesh Patel

    Dwarkesh Podcast

    505 Listeners

    Hard Fork by The New York Times

    Hard Fork

    5,473 Listeners

    The Ezra Klein Show by New York Times Opinion

    The Ezra Klein Show

    16,053 Listeners

    Moonshots with Peter Diamandis by PHD Ventures

    Moonshots with Peter Diamandis

    540 Listeners

    No Priors: Artificial Intelligence | Technology | Startups by Conviction

    No Priors: Artificial Intelligence | Technology | Startups

    132 Listeners

    Latent Space: The AI Engineer Podcast by swyx + Alessio

    Latent Space: The AI Engineer Podcast

    96 Listeners

    BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

    BG2Pod with Brad Gerstner and Bill Gurley

    517 Listeners