Compact 8B multimodal instruct model optimized for edge deployment
High-performance MoE model with MLA, MTP, and multilingual reasoning
High-compute ultra-reasoning model surpassing model surpassing GPT-5
High-efficiency reasoning and agentic intelligence model
685B model with improved agents and consistency
Qwen3-Next: 80B instruct LLM with ultra-long context up to 1M tokens
JetBrains’ 4B parameter code model for completions
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
An advanced bilingual image editing with semantic control
Vision-language-action model for robot control via images and text
Instruction-tuned 1.2B LLM for multilingual text generation by Meta
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
CLIP model fine-tuned for zero-shot fashion product classification
Efficient 13B MoE language model with long context and reasoning modes
Tiny pre-trained IBM model for multivariate time series forecasting
Russian ASR model fine-tuned on Common Voice and CSS10 datasets
Frontier-scale 675B multimodal base model for custom AI training
Speculative-decoding accelerator for the 675B Mistral Large 3
Quantized 675B multimodal instruct model optimized for NVFP4
Small 3B-base multimodal model ideal for custom AI on edge hardware
Efficient 8B multimodal model tuned for advanced reasoning tasks.
High-precision 14B multimodal model built for advanced reasoning tasks
Ultra-efficient 3B multimodal instruct model built for edge deployment
Efficient 14B multimodal instruct model with edge deployment and FP8