
Sign up to save your podcasts
Or
DeepSeek-R1-Lite-Preview was announced today. Post. Chatbot. Chinese blogpost translation.
DeepSeek says it will release the weights.
The model appears to be stronger than o1-preview on math, similar on coding, and weaker on everything else.
DeepSeek is Chinese. I'm not really familiar with the company. I thought Chinese companies were at least a year behind the frontier. Chinese companies tend to game benchmarks more than the frontier Western companies, but I think DeepSeek does this less than other Chinese companies.
The blogpost also shows inference-time scaling, like o1:
The original text contained 2 images which were described by AI.
---
First published:
Source:
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
DeepSeek-R1-Lite-Preview was announced today. Post. Chatbot. Chinese blogpost translation.
DeepSeek says it will release the weights.
The model appears to be stronger than o1-preview on math, similar on coding, and weaker on everything else.
DeepSeek is Chinese. I'm not really familiar with the company. I thought Chinese companies were at least a year behind the frontier. Chinese companies tend to game benchmarks more than the frontier Western companies, but I think DeepSeek does this less than other Chinese companies.
The blogpost also shows inference-time scaling, like o1:
The original text contained 2 images which were described by AI.
---
First published:
Source:
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
26,362 Listeners
2,380 Listeners
7,924 Listeners
4,131 Listeners
87 Listeners
1,447 Listeners
8,922 Listeners
88 Listeners
379 Listeners
5,425 Listeners
15,206 Listeners
475 Listeners
121 Listeners
77 Listeners
455 Listeners