by Elijah Feb 19,2025
DeepSeek's surprisingly inexpensive AI model challenges industry giants. The Chinese startup claims to have trained its powerful DeepSeek V3 neural network for a mere $6 million, utilizing only 2048 GPUs, significantly undercutting competitors. This seemingly low cost, however, belies a far more substantial investment.
Image: ensigame.com
DeepSeek V3's innovative architecture contributes to its efficiency. Key technologies include Multi-token Prediction (MTP), which forecasts multiple words simultaneously; Mixture of Experts (MoE), employing 256 neural networks for accelerated training; and Multi-head Latent Attention (MLA), focusing on crucial sentence elements for improved accuracy.
Image: ensigame.com
However, a SemiAnalysis report reveals a much larger infrastructure: approximately 50,000 Nvidia GPUs, valued at roughly $1.6 billion, with operational costs nearing $944 million. This contradicts DeepSeek's initial cost claim, which only accounts for pre-training GPU usage, omitting research, refinement, data processing, and overall infrastructure expenses.
DeepSeek's independence and lean structure, a subsidiary of High-Flyer hedge fund, allow for rapid innovation and efficient resource allocation. Unlike competitors reliant on cloud services, DeepSeek owns its data centers, maximizing control and speed. Its self-funded nature further enhances agility. High salaries, exceeding $1.3 million annually for some researchers, attract top talent from Chinese universities.
Image: ensigame.com
While DeepSeek's $6 million training cost figure is misleading, the company's overall investment exceeds $500 million. Its success stems from substantial investment, technological advancements, and a skilled workforce, not solely from budget efficiency. Despite this, its costs remain significantly lower than competitors; DeepSeek's R1 model cost $5 million compared to ChatGPT4's $100 million.
Image: ensigame.com
DeepSeek's example showcases a successful, well-funded independent AI company competing effectively with established players. However, the narrative of revolutionary cost-effectiveness requires a more nuanced understanding of its true investment scale.
"Clair Obscur: Expedition 33 Hits 1 Million Sales in 3 Days"
Roblox Deep Descent: January 2025 Codes Revealed
How to Feed Villagers in Necesse
Ragnarok V: Returns Beginner's Guide - Classes, Controls, Quests, Gameplay Explained
Bitlife: How to Complete the Renaissance Challenge
"Ōkami 2: Capcom, Kamiya, and Machine Head Discuss Sequel in Exclusive Interview"
Bahiti Hero Guide: Mastering the Epic Marksman in Whiteout Survival
Top 10 Liam Neeson Films Ranked
Wingspan Asia Debuts with Cross-Play Feature
Dec 16,2025
New Love and Deepspace Event: Witnessed by Deepspace
Dec 14,2025
Minecraft's Visual Refresh Rolls Out Now (45字符,控制在50字符内。关键词"Minecraft"前置,"Visual Refresh"替换原词更简洁,"Rolls Out Now"强调新闻时效性)
Dec 14,2025
Yu-Gi-Oh! Master Duel\'s Shadow Duelist revealed as Dylan Sprouse
Dec 13,2025
Pokémon Go Adds Hyper Training, Ancient Pass
Dec 13,2025