Locally run an Instruction-Tuned Chat-Style LLM
A latent text-to-image diffusion model
A collection of high-quality models for the MuJoCo physics engine
Generate embeddings from large-scale graph-structured data
LL model providing reasoning and conversational capabilities
Speculative-decoding accelerator for the 675B Mistral Large 3
Self-evolving AI model for agents, coding, and complex workflows
Ultra-efficient 3B multimodal instruct model built for edge deployment
Compact 8B multimodal instruct model optimized for edge deployment
Efficient 8B multimodal model tuned for advanced reasoning tasks.
High-precision 14B multimodal model built for advanced reasoning tasks
Efficient 14B multimodal instruct model with edge deployment and FP8
Frontier-scale 675B multimodal instruct MoE model for enterprise AIMis
Compact 3B-param multimodal model for efficient on-device reasoning
Lightweight on-device model for private AI text redaction
Stable fine-tuned Gemma model for structured, clear responses
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI
Powerful 14B LLM with strong instruction and long-text handling
Unified multimodal Gemma model for local coding and reasoning
Flagship MoE model for long-context agents and complex coding
Omnimodal AI model for agents, coding, and long-context tasks
Agentic coding model combining Opus reasoning and Fable tools
Flagship Poolside model for agentic coding and software engineering
Quantized 675B multimodal instruct model optimized for NVFP4