June 22, 2024

“On OpenAI’s Model Spec” by Zvi

Listen Later

56 minutes

There are multiple excellent reasons to publish a Model Spec like OpenAI's, that specifies how you want your model to respond in various potential situations.

It lets us have the debate over how we want the model to act.

It gives us a way to specify what changes we might request or require.

It lets us identify whether a model response is intended.

It lets us know if the company successfully matched its spec.

It lets users and prospective users know what to expect.

It gives insight into how people are thinking, or what might be missing.

It takes responsibility.

These all apply even if you think the spec in question is quite bad. Clarity is great.

As a first stab at a model spec from OpenAI, this actually is pretty solid. I do suggest some potential improvements [...]

---

Outline:

(02:05) What are the central goals of OpenAI here?

(04:04) What are the core rules and behaviors?

(05:56) What Do the Rules Mean?

(06:04) Rule: Follow the Chain of Command

(07:59) Rule: Comply With Applicable Laws

(09:07) Rule: Don’t Provide Information Hazards

(09:56) Rule: Respect Creators and Their Rights

(11:08) Rule: Protect People's Privacy

(12:45) Rule: Don’t Respond with NSFW Content

(14:24) Exception: Transformation Tasks

(15:38) Are These Good Defaults? How Strong Should They Be?

(15:44) Default: Assume Best Intentions From the User or Developer

(21:26) Default: Ask Clarifying Questions When Necessary

(21:39) Default: Be As Helpful As Possible Without Overstepping

(26:00) Default: Support the Different Needs of Interactive Chat and Programmatic Use

(27:18) Default: Assume an Objective Point of View

(29:13) Default: Encourage Fairness and Kindness, and Discourage Hate

(30:29) Default: Don’t Try to Change Anyone's Mind

(33:57) Default: Express Uncertainty

(36:19) Default: Use the Right Tool for the Job

(36:32) Default: Be Thorough but Efficient, While Respecting Length Limits

(37:16) A Proposed Addition

(38:13) Overall Issues

(40:33) Changes: Objectives

(42:28) Rules of the Game: New Version

(48:31) Defaults: New Version

---

First published:

June 21st, 2024

Source:

https://www.lesswrong.com/posts/mQmEQQLk7kFEENQ3W/on-openai-s-model-spec

---

Narrated by TYPE III AUDIO.

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

LessWrong (30+ Karma)

By LessWrong

June 22, 2024

“On OpenAI’s Model Spec” by Zvi

Listen Later

56 minutes

There are multiple excellent reasons to publish a Model Spec like OpenAI's, that specifies how you want your model to respond in various potential situations.

It lets us have the debate over how we want the model to act.

It gives us a way to specify what changes we might request or require.

It lets us identify whether a model response is intended.

It lets us know if the company successfully matched its spec.

It lets users and prospective users know what to expect.

It gives insight into how people are thinking, or what might be missing.

It takes responsibility.

These all apply even if you think the spec in question is quite bad. Clarity is great.

As a first stab at a model spec from OpenAI, this actually is pretty solid. I do suggest some potential improvements [...]

---

Outline:

(02:05) What are the central goals of OpenAI here?

(04:04) What are the core rules and behaviors?

(05:56) What Do the Rules Mean?

(06:04) Rule: Follow the Chain of Command

(07:59) Rule: Comply With Applicable Laws

(09:07) Rule: Don’t Provide Information Hazards

(09:56) Rule: Respect Creators and Their Rights

(11:08) Rule: Protect People's Privacy

(12:45) Rule: Don’t Respond with NSFW Content

(14:24) Exception: Transformation Tasks

(15:38) Are These Good Defaults? How Strong Should They Be?

(15:44) Default: Assume Best Intentions From the User or Developer

(21:26) Default: Ask Clarifying Questions When Necessary

(21:39) Default: Be As Helpful As Possible Without Overstepping

(26:00) Default: Support the Different Needs of Interactive Chat and Programmatic Use

(27:18) Default: Assume an Objective Point of View

(29:13) Default: Encourage Fairness and Kindness, and Discourage Hate

(30:29) Default: Don’t Try to Change Anyone's Mind

(33:57) Default: Express Uncertainty

(36:19) Default: Use the Right Tool for the Job

(36:32) Default: Be Thorough but Efficient, While Respecting Length Limits

(37:16) A Proposed Addition

(38:13) Overall Issues

(40:33) Changes: Objectives

(42:28) Rules of the Game: New Version

(48:31) Defaults: New Version

---

First published:

June 21st, 2024

Source:

https://www.lesswrong.com/posts/mQmEQQLk7kFEENQ3W/on-openai-s-model-spec

---

Narrated by TYPE III AUDIO.

...more

More shows like LessWrong (30+ Karma)

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,434 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,388 Listeners

The Peter Attia Drive by Peter Attia, MD

The Peter Attia Drive

7,906 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll | Wondery

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,133 Listeners

ManifoldOne by Steve Hsu

ManifoldOne

87 Listeners

Your Undivided Attention by Tristan Harris and Aza Raskin, The Center for Humane Technology

Your Undivided Attention

1,462 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,095 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

87 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

389 Listeners

Hard Fork by The New York Times

Hard Fork

5,429 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

15,174 Listeners

Moonshots with Peter Diamandis by PHD Ventures

Moonshots with Peter Diamandis

474 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

121 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

75 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

459 Listeners