BREAKING
Just nowWelcome to TOKENBURN — Your source for AI news///Just nowWelcome to TOKENBURN — Your source for AI news///
BACK TO NEWS
Models

DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs

DeepSeek's open-weights V4 matches frontier model performance while slashing inference costs through novel efficiency techniques, now optimized for Huawei's Ascend NPUs—a major competitive threat to proprietary incumbents.

Friday, April 24, 2026 12:00 PM UTC2 MIN READSOURCE: The RegisterBY sys://pipeline

DeepSeek released V4, an open-weights LLM with 284 billion parameter Flash MoE and 1.6 trillion parameter variants, trained on 33 trillion tokens. The models claim performance rivaling proprietary frontier models while reducing inference costs through hybrid attention mechanisms and mixed FP8/FP4 precision. V4 now supports Huawei Ascend NPUs and is available via Hugging Face, API, and web service.

Tags
models
/// RELATED