Strong, Economical, and Efficient Mixture-of-Experts Language Model
Official inference repo for FLUX.2 models
Open-weight, large-scale hybrid-attention reasoning model
Official inference repo for FLUX.1 models
Safety reasoning models built-upon gpt-oss
MiniMax-M2, a model built for Max coding & agentic workflows
Bidirectional token-classification model for identifiable info
Renderer for the harmony response format to be used with gpt-oss
Towards Real-World Vision-Language Understanding
DeepSeek LLM: Let there be answers
Model that fuses instruct, reasoning and agentic skills
Open-source code agent designed for Lean 4
OpenAI’s open-weight 120B model optimized for reasoning and tooling
Flagship MoE model for advanced reasoning, coding, and agents
OpenAI’s compact 20B open model for fast, agentic, and local use
Efficient MoE reasoning model for coding and math workloads
Dense multimodal Qwen model for coding, agents, and long context
Open multimodal model for coding, agents, and long-context tasks
Self-evolving AI model for agents, coding, and complex workflows
4-bit Command A+ model for enterprise agents and multilingual tasks
FP8 Qwen model for efficient multimodal coding and agent tasks
Qwen3-Next: 80B instruct LLM with ultra-long context up to 1M tokens
Frontier-scale 675B multimodal base model for custom AI training