Unified multimodal Gemma model for local coding and reasoning
4-bit Command A+ model for enterprise agents and multilingual tasks
Efficient MoE model for million-token reasoning and coding
High-performance MoE model with MLA, MTP, and multilingual reasoning
High-compute ultra-reasoning model surpassing model surpassing GPT-5
685B model with improved agents and consistency
Qwen3-Next: 80B instruct LLM with ultra-long context up to 1M tokens
JetBrains’ 4B parameter code model for completions
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
Tiny pre-trained IBM model for multivariate time series forecasting
Dia-1.6B generates lifelike English dialogue and vocal expressions