LongCat-2.0 is Meituan’s new open-model announcement for a very large mixture-of-experts model. The post claims 1.6 trillion total parameters with 48 billion active, pretraining over 35 trillion tokens, and both training and serving on huge in-house AI ASIC clusters. It also highlights architectural choices like n-gram embeddings and pitches the model as a serious large-scale system rather than a consumer-local model. That framing drove the real interest. People read this less as “here is one more chatbot” and more as evidence that a Chinese company outside the usual AI-lab shortlist may have trained and deployed a giant model on a non-Nvidia stack.
The strongest thread through the comments was that the hardware claim is the headline. Several readers inferred Huawei Ascend hardware from the blog wording and argued that, if true, the bigger milestone is not the model’s benchmark position but that a full pretrain-to-post-train pipeline may now exist on Chinese accelerators despite weaker software tooling. That turns export controls from a hard blocker into a forcing function. Others pushed the same point more broadly. Nvidia’s moat is not just chips, it is the software and operations ecosystem around them, and this launch suggests that moat is no longer untouchable in China.
At the same time, nobody treated the model claims as fully settled. The Hugging Face page had no downloadable weights yet, tool support looked rough, and several people could not tell from the materials how much was new versus how much built directly on
DeepSeek V4-Pro. One commenter backed off an initial “is this just a finetune” reaction after noticing real architectural additions, but the missing report and missing artifacts kept skepticism high. Practical testing did not rescue the launch. Early users reported mixed quality, including a niche physics question where LongCat gave a polished but wrong answer while Qwen and Gemini did better, plus the expected reports of refusals on politically sensitive prompts. The net read was clear: the infrastructure story looks more important than the model experience today, and the absence of weights keeps this closer to a claim than a proved open release.