Open multimodal model for coding, agents, and long-context tasks
Lightweight 24B agentic coding model with vision and long context
Large-scale xAI model for local inference with SGLang, Grok-2.5
BGE-Large v1.5: High-accuracy English embedding model for retrieval
Summarization model fine-tuned on CNN/DailyMail articles
Multimodal 7B model for image, video, and text understanding tasks
Compact 3B-param multimodal model for efficient on-device reasoning
Custom BLEURT model for evaluating text similarity using PyTorch
NVFP4 DiffusionGemma model for fast multimodal text generation
Unified multimodal Gemma model for local coding and reasoning
Google’s flagship dense multimodal model for coding and reasoning
Agentic 123B coding model optimized for large-scale engineering
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
Efficient 13B MoE language model with long context and reasoning modes
Powerful 14B-base multimodal model — flexible base for fine-tuning
Agentic coding model combining Opus reasoning and Fable tools
Flagship Poolside model for agentic coding and software engineering
Speculative-decoding accelerator for the 675B Mistral Large 3
Quantized 675B multimodal instruct model optimized for NVFP4
Efficient 14B multimodal instruct model with edge deployment and FP8
Open, non-commercial SDXL model for quality image generation
Lightweight multimodal translation model for 55 languages
OpenAI’s open-weight 120B model optimized for reasoning and tooling
Compact English sentence embedding model for semantic search tasks
QwQ-32B is a reasoning-focused language model for complex tasks