🇨🇳 China AI

DeepSeek V4 Optimized for Huawei Ascend Chips With 75% Permanent Price Cut

📅 July 3, 2026 ⏱️ 9 min read ✍️ AI News Desk
DeepSeek V4 AI optimization running on Huawei Ascend processors in a Chinese data center

📌 Key Takeaways

The battle for AI supremacy in China has entered a decisive new phase. DeepSeek, one of the country's most prominent AI laboratories, has announced that its latest model — V4 — has been fully optimized to run on Huawei's Ascend processors. In a move that sent shockwaves through the semiconductor industry, the company simultaneously slashed the price of its V4-Pro tier by a staggering 75% permanently.

This dual announcement — technical alignment with Huawei hardware and aggressive pricing — represents more than a product update. It signals a structural realignment in China's AI ecosystem, one that could reshape the global competitive dynamics of artificial intelligence development for years to come.

DeepSeek V4's Huawei Ascend Optimization

For much of the AI industry's explosive growth, Nvidia's CUDA ecosystem has served as the de facto operating layer for deep learning. Training and inference workloads alike have been built around CUDA's software stack, creating a moat that competitors struggled to cross. DeepSeek's decision to optimize V4 for Ascend processors fundamentally challenges that monopoly.

The optimization effort was not a simple port. DeepSeek's engineering team reportedly rewrote significant portions of the inference kernel to leverage Ascend's matrix computation units directly. The result is a model that runs at competitive performance levels on Ascend silicon without the translation layers that typically degrade performance when deploying CUDA-native models on non-Nvidia hardware.

"DeepSeek's Ascend optimization proves that the CUDA moat is narrower than many assumed. When a frontier lab commits fully to alternative silicon, the performance gap closes faster than the market expected."

The implications extend beyond DeepSeek itself. By demonstrating that a frontier-class model can run efficiently on domestic hardware, DeepSeek has created a template that other Chinese AI labs are expected to follow. This removes one of the last major technical justifications for continued reliance on Nvidia chips in the Chinese market.

The 75% Permanent Price Cut Explained

75%
Permanent Price Cut on DeepSeek V4-Pro Tier

The pricing reduction is not a temporary promotion or limited-time offer. DeepSeek has explicitly characterized the 75% cut on the V4-Pro tier as permanent, reflecting the company's confidence in both its cost structure and the long-term viability of Ascend-based inference.

At the new price point, DeepSeek V4-Pro becomes one of the most cost-effective frontier AI models in the world. This positions it competitively against Western alternatives like GPT-4o, Claude 3.5, and Gemini Ultra — not just in raw capability, but in the economics of token consumption that matter most for enterprise deployment at scale.

The pricing strategy also carries geopolitical significance. By making V4 dramatically cheaper than comparable Western models, DeepSeek creates a powerful incentive for developers in the Global South and other price-sensitive markets to adopt Chinese AI infrastructure. This is a deliberate play for ecosystem lock-in through economic advantage.

Why is the price cut permanent? DeepSeek benefits from lower inference costs on Ascend hardware, reduced chip procurement expenses compared to Nvidia, and massive scale economies driven by China's explosive token consumption growth. The company can sustain lower margins while maintaining profitability through volume.

ByteDance's $5.6 Billion Ascend Investment

ByteDance, the parent company of TikTok and one of the world's largest consumers of AI compute, has emerged as the single largest buyer of Huawei Ascend chips. The company's 2026 procurement plan includes:

This level of commitment from ByteDance is particularly significant because the company had previously been one of the largest purchasers of Nvidia's A100 and H100 GPUs in China. The pivot to Ascend reflects both pragmatic necessity — Nvidia's export restrictions have made high-end GPU procurement increasingly unreliable — and strategic conviction that domestic alternatives have reached production readiness.

Alibaba and Tencent Join the Ascend Wave

ByteDance is not alone. Two of China's other technology giants have placed substantial orders for Ascend 950 chips:

The coordinated purchasing behavior suggests that these companies have arrived at a shared conclusion: relying on Nvidia as a primary compute provider is no longer a viable strategy given current geopolitical constraints. Huawei's Ascend platform offers the combination of domestic availability, improving performance, and government support that makes it the logical default for Chinese hyperscalers.

Ascend 950: Production Targets and Performance

The Ascend 950 is Huawei's latest and most capable AI processor, representing the culmination of years of development under U.S. sanctions pressure. Key specifications and production details include:

750K
Ascend 950 Production Target for 2026

The production target of 750,000 units represents a significant scaling effort by Huawei and its manufacturing partners. While this number still falls short of global demand for AI chips, it is sufficient to meet the near-term needs of China's largest technology companies and establishes a domestic supply chain that can grow incrementally.

China's AI Token Usage Surpasses 140 Trillion Daily

The economics of AI are increasingly driven by token throughput — the volume of text tokens processed by models in training and inference. China's AI ecosystem has reached a scale that is difficult to comprehend:

140T+
Daily AI Tokens Processed in China (March 2026)

In March 2026, China's aggregate daily AI token usage surpassed 140 trillion tokens. This figure encompasses text generation, translation, code completion, image captioning, and multimodal tasks across all major Chinese AI platforms. The number has been growing at approximately 15-20% month-over-month, driven by:

This token volume directly fuels the demand for Ascend chips. At 140 trillion tokens per day, the inference compute requirements alone justify the massive procurement budgets that ByteDance, Alibaba, and Tencent are committing to Huawei's platform.

Chinese Models Command 61% of OpenRouter Tokens

Perhaps the most striking indicator of Chinese AI's global competitive position comes from OpenRouter, one of the largest multi-model API platforms used by developers worldwide. As of mid-2026, Chinese AI models account for 61% of total token consumption on the platform.

This dominance is not limited to Chinese users. Developers in Southeast Asia, the Middle East, Africa, and Latin America increasingly choose Chinese models on OpenRouter because they offer comparable quality to Western alternatives at significantly lower cost. DeepSeek's pricing, in particular, has made it the default choice for cost-conscious developers building at scale.

The 61% figure represents a remarkable shift. Just two years ago, Western models held an overwhelming majority of OpenRouter's traffic. The speed of this transition underscores how effectively Chinese AI labs have leveraged domestic hardware advantages, aggressive pricing, and rapid iteration cycles to capture global market share.

What This Means for the Global AI Landscape

The convergence of DeepSeek's Ascend optimization, permanent price cuts, and the massive scale of China's AI ecosystem creates several profound implications:

1. The Nvidia moat is narrower than assumed. DeepSeek has demonstrated that frontier AI performance is achievable on non-CUDA hardware. As Ascend's software ecosystem matures, the technical barriers to switching will continue to decrease.

2. Price competition will intensify globally. DeepSeek's 75% permanent cut sets a new floor for AI pricing. Western AI companies will face pressure to reduce costs or risk losing developers to cheaper alternatives.

3. The geopolitical AI divide is solidifying. With Chinese companies building a self-reliant AI stack from chips to models to applications, the bifurcation of the global AI ecosystem is no longer theoretical. It is happening now.

4. Scale advantages compound quickly. At 140 trillion tokens per day, China's AI ecosystem generates vast datasets and feedback loops that improve model quality. Combined with lower costs, this creates a virtuous cycle that is difficult for competitors to break.

5. Enterprise AI adoption accelerates. When inference costs drop by 75%, use cases that were previously uneconomical become viable. Chinese enterprises are now deploying AI in scenarios that would have been cost-prohibitive just six months ago.

The bottom line: DeepSeek V4's Ascend optimization and permanent price cut are not isolated events. They are the leading edge of a structural transformation in China's AI industry — one that reduces dependence on Western technology while dramatically expanding the economic viability of AI deployment at scale.

Frequently Asked Questions

What is DeepSeek V4 and why is the Ascend optimization significant?

DeepSeek V4 is the latest frontier AI model from Chinese AI lab DeepSeek. Its significance lies in being fully optimized for Huawei Ascend processors rather than Nvidia GPUs. This marks the first time a frontier-class Chinese AI model has been purpose-built for domestic silicon, demonstrating that competitive AI performance is achievable without Nvidia's CUDA ecosystem.

How does the 75% price cut affect DeepSeek V4-Pro's competitiveness?

The 75% permanent price reduction makes DeepSeek V4-Pro one of the most affordable frontier AI models available globally. At the new pricing tier, it significantly undercuts Western alternatives like GPT-4o and Claude 3.5 on a per-token basis, making it the preferred choice for developers and enterprises building cost-sensitive AI applications at scale.

Why is ByteDance spending $5.6 billion on Huawei Ascend chips?

ByteDance's massive investment reflects both necessity and strategic conviction. U.S. export restrictions have made Nvidia GPU procurement unreliable for Chinese companies, while Huawei's Ascend 950 has reached performance levels sufficient for ByteDance's inference workloads across TikTok, Douyin, and internal LLM development. The investment ensures stable compute supply and domestic supply chain resilience.

Can Huawei Ascend 950 compete with Nvidia's latest chips?

While Ascend 950 does not match Nvidia's H100 or B200 in raw peak performance, it has become competitive for inference workloads — which represent the majority of AI compute demand. DeepSeek's optimization work proves that software tuning can close much of the performance gap. For Chinese companies facing export restrictions, Ascend offers a practical and increasingly capable alternative.

What does 140 trillion daily tokens mean for AI development in China?

The 140 trillion daily token figure indicates that AI has moved from experimental to deeply embedded in China's digital infrastructure. This scale of usage drives continuous model improvement through real-world feedback, creates massive revenue opportunities for inference providers, and justifies the billions being invested in domestic AI chip production.

Will Chinese AI models continue to gain global market share?

The trajectory suggests continued growth. Chinese models already capture 61% of token consumption on OpenRouter, driven by aggressive pricing and competitive quality. As DeepSeek and similar labs continue optimizing for Ascend hardware and driving costs lower, their economic advantage in global markets is likely to widen, particularly in price-sensitive regions.

Sources

  1. Reuters — "ByteDance plans $5.6 billion Huawei Ascend chip purchase for 2026 AI expansion" (June 2026)
  2. South China Morning Post — "Alibaba, Tencent join ByteDance in massive Ascend 950 orders" (June 2026)
  3. DeepSeek Official Blog — "V4 Model: Ascend Optimization and Pricing Update" (July 2026)
  4. Huawei Technologies — "Ascend 950 Production Targets and Ecosystem Update" (Q2 2026)
  5. OpenRouter Platform Analytics — "Model Usage Distribution Report" (Mid-2026)
  6. China Academy of Information and Communications Technology (CAICT) — "AI Token Consumption Report" (March 2026)
  7. The Information — "Chinese AI Labs Pivot to Domestic Chips as Nvidia Access Tightens" (May 2026)
  8. Nikkei Asia — "Huawei Ascend Chip Production Scaling to 750,000 Units in 2026" (June 2026)
AI

AI News Desk

Covering the intersection of artificial intelligence, semiconductors, and global technology policy. Our team tracks developments across the AI industry to deliver accurate, timely analysis for developers, investors, and decision-makers.

Stay Ahead of the AI Curve

Get weekly analysis of AI industry shifts, chip developments, and pricing trends delivered to your inbox.