What's going on with AI

DeepSeek-V3_ A 671B parameter language model


Listen Later

Peter Dawell and Nora Kane talk about DeepSeek-V3, a large language model with 671 billion parameters developed using innovative architectures and training methods. It achieves results comparable to top-of-the-line closed systems while outperforming many open source models. The model is offered on Hugging Face and can be operated locally on various hardware platforms (including AMD and Huawei Ascend) with various frameworks. The documentation includes detailed instructions for local execution and evaluates the performance of the model against various benchmarks. Commercial use is supported.

...more
View all episodesView all episodes
Download on the App Store

What's going on with AIBy Peter Dawell, Nora Kane