LongCat-2.0LongCat
|
Step 3.5 FlashStepFun
|
|||||
Related Products
|
||||||
About
LongCat-2.0 is a 1.6 trillion total-parameter Mixture-of-Experts language model built on AI ASIC superpods, with about 48 billion parameters activated per token and strong performance across coding and agentic tasks. It is a substantial step up from previous LongCat models, combining large-scale sparse architecture with dedicated post-training for real-world software engineering, tool use, long-context reasoning, and multi-step agent workflows. LongCat-2.0 is trained and deployed entirely on AI ASIC superpods, with pretraining spanning more than 35 trillion tokens and millions of accelerator-hours, demonstrating frontier-scale training on alternative hardware platforms. To strengthen long-horizon tasks, the model introduces LongCat Sparse Attention and is trained on hundreds of billions of tokens of 1M-context data, giving it native support for ultra-long context tasks and reliable long-document understanding.
|
About
Step 3.5 Flash is an advanced open source foundation language model engineered for frontier reasoning and agentic capabilities with exceptional efficiency, built on a sparse Mixture of Experts (MoE) architecture that selectively activates only about 11 billion of its ~196 billion parameters per token to deliver high-density intelligence and real-time responsiveness. Its 3-way Multi-Token Prediction (MTP-3) enables generation throughput in the hundreds of tokens per second for complex multi-step reasoning chains and task execution, and it supports efficient long contexts with a hybrid sliding window attention approach that reduces computational overhead across large datasets or codebases. It demonstrates robust performance on benchmarks for reasoning, coding, and agentic tasks, rivaling or exceeding many larger proprietary models, and includes a scalable reinforcement learning framework for consistent self-improvement.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
AI coding-platform teams that need a large open MoE model for agentic coding, long-context reasoning, tool use, and complex software automation
|
Audience
Developers, researchers, and AI engineers who want a powerful open source foundational AI model capable of fast, deep reasoning, coding assistance, and agentic task execution
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
No information available.
Free Version
Free Trial
|
Pricing
Free
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationLongCat
Founded: 2023
China
longcat.chat/blog/longcat-2.0/
|
Company InformationStepFun
Founded: 2023
China
static.stepfun.com/blog/step-3.5-flash/
|
|||||
Alternatives |
Alternatives |
|||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
Categories |
Categories |
|||||
Integrations
Claude Code
GitHub
Hermes Agent
Hugging Face
ModelScope
OpenClaw
arXiv
|
Integrations
Claude Code
GitHub
Hermes Agent
Hugging Face
ModelScope
OpenClaw
arXiv
|
|||||
|
|
|