China's longcat-2.0: a 1.6t parameter giant trained without nvidia \xe2\x80\x94 AlexTech

The global AI arms race has entered a new phase of geopolitical defiance. Meituan, the Chinese services giant, has released LongCat-2.0, a frontier-scale language model that challenges the long-held assumption that trillion-parameter AI requires Nvidia's ecosystem to be viable. The most striking aspect of this release is the infrastructure: LongCat-2.0 was trained entirely on Chinese-made hardware, marking a significant milestone in operational independence.

MoE Architecture and Massive Scale

LongCat-2.0 utilizes a Mixture-of-Experts (MoE) architecture, boasting a total of 1.6 trillion parameters, with approximately 48 billion activated per token. This design allows the model to maintain high-level reasoning capabilities while optimizing computational costs during inference. The training process was an engineering feat, utilizing a cluster of over 50,000 domestic AI ASICs and processing more than 35 trillion tokens.

The team behind LongCat had to overcome significant technical hurdles, including communication faults at scale, memory pressure, and numerical stability—challenges typically solved using Nvidia's proprietary software stack. By overcoming these obstacles, Meituan has proven that a fully domestic AI stack is technically viable for frontier-scale training.

Performance Analysis: Coding vs. General Reasoning

Positioned as an agentic coding model, LongCat-2.0 shows impressive results in specialized benchmarks. According to reports from The Decoder, the model outperforms Gemini 3.1 Pro and GPT-5.5 on SWE-bench Pro (59.5) and SWE-bench Multilingual (77.3), although it still trails behind Claude Opus 4.7 and 4.8.

However, the gap is wider in general reasoning tasks. On benchmarks such as IFEval (90.0), IMO-AnswerBench (81.8), and GPQA-diamond (88.9), LongCat-2.0 falls short of the top Western flagship models. Despite this, achieving competitive coding performance on non-Nvidia hardware is a clear signal that the technical barrier to entry for trillion-parameter models has lowered.

Defying US Export Controls

Since 2022, the US government has imposed strict export controls to limit China's access to high-end GPUs like the H100 and B200. LongCat-2.0 is a direct response to these restrictions. By open-sourcing the model under a permissive MIT license, Meituan is not only demonstrating technical prowess but also adopting a distribution strategy similar to Meta's Llama. This approach aims to attract global developer mindshare and accelerate the adoption of Chinese AI standards.

Global Implications for Sovereign AI

The emergence of LongCat-2.0 suggests that the gap between Chinese open-source models and Western closed systems may shrink faster than previously forecasted. As nations strive for "Sovereign AI" to avoid dependency on a few US-based providers, China's ability to build an end-to-end pipeline—from silicon to software—provides a blueprint for total technological autonomy. The era where Nvidia's dominance was the only path to frontier AI may be coming to an end.

Note: AI-Generated Content: This article was created with the support of AI tools and subsequently supervised by the site curator. There may be inaccuracies or missing updates; we recommend verifying original sources before making decisions based on the content.

MoE Architecture and Massive Scale

Performance Analysis: Coding vs. General Reasoning

Defying US Export Controls

Global Implications for Sovereign AI

Related Articles