LessWrong posts by zvi

“On DeepSeek’s r1” by Zvi


Listen Later

r1 from DeepSeek is here, the first serious challenge to OpenAI's o1.

r1 is an open model, and it comes in dramatically cheaper than o1.

People are very excited. Normally cost is not a big deal, but o1 and its inference-time compute strategy is the exception. Here, cheaper really can mean better, even if the answers aren’t quite as good.

You can get DeepSeek-r1 on HuggingFace here, and they link to the paper.

The question is how to think about r1 as it compares to o1, and also to o1 Pro and to the future o3-mini that we’ll get in a few weeks, and then to o3 which we’ll likely get in a month or two.

Taking into account everything I’ve seen, r1 is still a notch below o1 in terms of quality of output, and further behind o1 Pro and the future o3-mini [...]

---

Outline:

(01:43) Part 1: RTFP: Read the Paper

(03:38) How Did They Do It

(06:19) The Aha Moment

(08:27) Benchmarks

(09:46) Reports of Failure

(11:11) Part 2: Capabilities Analysis

(11:16) Our Price Cheap

(15:44) Other People's Benchmarks

(18:20) r1 Makes Traditional Silly Mistakes

(23:11) The Overall Vibes

(25:36) If I Could Read Your Mind

(28:06) Creative Writing

(32:21) Bring On the Spice

(34:33) We Cracked Up All the Censors

(39:44) Switching Costs Are Low In Theory

(42:15) The Self-Improvement Loop

(44:18) Room for Improvement

(48:27) Part 3: Where Does This Leave Us on Existential Risk?

(48:58) The Suicide Caucus

(51:21) v3 Implies r1

(53:09) Open Weights Are Unsafe And Nothing Can Fix This

(58:59) So What the Hell Should We Do About All This?

(01:05:53) Part 4: The Lighter Side

---

First published:

January 22nd, 2025

Source:

https://www.lesswrong.com/posts/buTWsjfwQGMvocEyw/on-deepseek-s-r1

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

...more
View all episodesView all episodes
Download on the App Store

LessWrong posts by zviBy zvi

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like LessWrong posts by zvi

View all
Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,398 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,422 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,085 Listeners

Future of Life Institute Podcast by Future of Life Institute

Future of Life Institute Podcast

107 Listeners

ChinaTalk by Jordan Schneider

ChinaTalk

289 Listeners

Politix by Politix

Politix

93 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

500 Listeners

Hard Fork by The New York Times

Hard Fork

5,466 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

130 Listeners

LessWrong (Curated & Popular) by LessWrong

LessWrong (Curated & Popular)

13 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

131 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

153 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

497 Listeners

LessWrong (30+ Karma) by LessWrong

LessWrong (30+ Karma)

0 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

133 Listeners