
Sign up to save your podcasts
Or


DeepSeek-R1-Lite-Preview was announced today. Post. Chatbot. Chinese blogpost translation.
DeepSeek says it will release the weights.
The model appears to be stronger than o1-preview on math, similar on coding, and weaker on everything else.
DeepSeek is Chinese. I'm not really familiar with the company. I thought Chinese companies were at least a year behind the frontier. Chinese companies tend to game benchmarks more than the frontier Western companies, but I think DeepSeek does this less than other Chinese companies.
The blogpost also shows inference-time scaling, like o1:
The original text contained 2 images which were described by AI.
---
First published:
Source:
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
By LessWrongDeepSeek-R1-Lite-Preview was announced today. Post. Chatbot. Chinese blogpost translation.
DeepSeek says it will release the weights.
The model appears to be stronger than o1-preview on math, similar on coding, and weaker on everything else.
DeepSeek is Chinese. I'm not really familiar with the company. I thought Chinese companies were at least a year behind the frontier. Chinese companies tend to game benchmarks more than the frontier Western companies, but I think DeepSeek does this less than other Chinese companies.
The blogpost also shows inference-time scaling, like o1:
The original text contained 2 images which were described by AI.
---
First published:
Source:
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

112,847 Listeners

130 Listeners

7,206 Listeners

531 Listeners

16,150 Listeners

4 Listeners

14 Listeners

2 Listeners