Bitget App
Trade smarter
Buy cryptoMarketsTradeFuturesEarnSquareMore
DeepSeek-R1 unveils new model "MODEL1" on its first anniversary

DeepSeek-R1 unveils new model "MODEL1" on its first anniversary

BlockBeatsBlockBeats2026/01/21 00:01
Show original

BlockBeats news, on January 21, according to Quantum Bit, DeepSeek-R1 revealed its new model "MODEL1" on the first anniversary of its release. DeepSeek updated the FlashMLA code on GitHub, where MODEL1 is mentioned in 28 places across 114 files, appearing as a different model from V32. It is known that V32 is DeepSeek-V3.2, so MODEL1 is likely a new architecture. The specific differences in the code are reflected in the KV cache layout, sparsity handling, and FP8 decoding, with several differences in memory optimization.

0
0

Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.

PoolX: Earn new token airdrops
Lock your assets and earn 10%+ APR
Lock now!