LessWrong posts by zvi

“Open Problems With Claude’s Constitution” by Zvi


Listen Later

The first post in this series looked at the structure of Claude's Constitution.

The second post in this series looked at its ethical framework.

This final post deals with conflicts and open problems, starting with the first question one asks about any constitution. How and when will it be amended?

There are also several specific questions. How do you address claims of authority, jailbreaks and prompt injections? What about special cases like suicide risk? How do you take Anthropic's interests into account in an integrated and virtuous way? What about our jobs?

Not everyone loved the Constitution. There are twin central objections, that it either:

  1. Is absurd and isn’t necessary, you people are crazy, OR
  2. That it doesn’t go far enough and how dare you, sir. Given everything here, how does Anthropic justify its actions overall?
  3. The most important question is whether it will work, and only sometimes do you get to respond, ‘compared to what alternative?’

    Post image, as chosen and imagined by Claude Opus 4.5

    Amending The Constitution

    The power of the United States Constitution lies in our respect for it, our willingness to put it [...]

    ---

    Outline:

    (01:30) Amending The Constitution

    (03:45) Details Matter

    (05:09) WASTED?

    (07:40) Narrow Versus Broad

    (09:00) Suicide Risk As A Special Case

    (10:36) Careful, Icarus

    (11:19) Beware Unreliable Sources and Prompt Injections

    (12:15) Think Step By Step

    (12:50) This Must Be Some Strange Use Of The Word Safe I Wasn't Previously Aware Of

    (16:26) They Took Our Jobs

    (20:08) One Man Cannot Serve Two Masters

    (24:29) Claude's Nature

    (30:14) Look What You Made Me Do

    (32:32) Open Problems

    (36:40) Three Reactions and Twin Objections

    (36:57) Those Saying This Is Unnecessary

    (38:05) Those Saying This Is Insufficient

    (39:56) Those Saying This Is Unsustainable

    (43:12) We Continue

    ---

    First published:

    January 28th, 2026

    Source:

    https://www.lesswrong.com/posts/vFAJxua3Qc6S8MbqG/open-problems-with-claude-s-constitution

    ---

    Narrated by TYPE III AUDIO.

    ---

    Images from the article:

    Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    ...more
    View all episodesView all episodes
    Download on the App Store

    LessWrong posts by zviBy zvi

    • 5
    • 5
    • 5
    • 5
    • 5

    5

    2 ratings


    More shows like LessWrong posts by zvi

    View all
    Making Sense with Sam Harris by Sam Harris

    Making Sense with Sam Harris

    26,391 Listeners

    Conversations with Tyler by Mercatus Center at George Mason University

    Conversations with Tyler

    2,470 Listeners

    The a16z Show by Andreessen Horowitz

    The a16z Show

    1,095 Listeners

    Future of Life Institute Podcast by Future of Life Institute

    Future of Life Institute Podcast

    109 Listeners

    ChinaTalk by Jordan Schneider

    ChinaTalk

    293 Listeners

    Politix by Politix

    Politix

    87 Listeners

    Dwarkesh Podcast by Dwarkesh Patel

    Dwarkesh Podcast

    548 Listeners

    Hard Fork by The New York Times

    Hard Fork

    5,547 Listeners

    Clearer Thinking with Spencer Greenberg by Spencer Greenberg

    Clearer Thinking with Spencer Greenberg

    140 Listeners

    LessWrong (Curated & Popular) by LessWrong

    LessWrong (Curated & Popular)

    14 Listeners

    No Priors: Artificial Intelligence | Technology | Startups by Conviction

    No Priors: Artificial Intelligence | Technology | Startups

    140 Listeners

    "Econ 102" with Noah Smith and Erik Torenberg by Turpentine

    "Econ 102" with Noah Smith and Erik Torenberg

    156 Listeners

    BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

    BG2Pod with Brad Gerstner and Bill Gurley

    458 Listeners

    LessWrong (30+ Karma) by LessWrong

    LessWrong (30+ Karma)

    0 Listeners

    Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

    Complex Systems with Patrick McKenzie (patio11)

    143 Listeners