Audience
AI coding-platform teams that need a large open MoE model for agentic coding, long-context reasoning, tool use, and complex software automation
About LongCat-2.0
LongCat-2.0 is a 1.6 trillion total-parameter Mixture-of-Experts language model built on AI ASIC superpods, with about 48 billion parameters activated per token and strong performance across coding and agentic tasks. It is a substantial step up from previous LongCat models, combining large-scale sparse architecture with dedicated post-training for real-world software engineering, tool use, long-context reasoning, and multi-step agent workflows. LongCat-2.0 is trained and deployed entirely on AI ASIC superpods, with pretraining spanning more than 35 trillion tokens and millions of accelerator-hours, demonstrating frontier-scale training on alternative hardware platforms. To strengthen long-horizon tasks, the model introduces LongCat Sparse Attention and is trained on hundreds of billions of tokens of 1M-context data, giving it native support for ultra-long context tasks and reliable long-document understanding.