Lightweight 24B agentic coding model with vision and long context
Tencent’s 36-language state-of-the-art translation model
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research
Large-scale xAI model for local inference with SGLang, Grok-2.5
Reasoning-powered OCR VLM for converting complex documents to Markdown
Compact hybrid reasoning language model for intelligent responses
Flexible text-to-text transformer model for multilingual NLP tasks
T5-Small: Lightweight text-to-text transformer for NLP tasks
BGE-Large v1.5: High-accuracy English embedding model for retrieval
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI
Summarization model fine-tuned on CNN/DailyMail articles
CTC-based forced aligner for audio-text in 158 languages
Multimodal 7B model for image, video, and text understanding tasks
Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input
Efficient English embedding model for semantic search and retrieval
Powerful 14B LLM with strong instruction and long-text handling
Dia-1.6B generates lifelike English dialogue and vocal expressions
4-bit Command A+ model for enterprise agents and multilingual tasks
Flagship MoE model for long-context agents and complex coding
Omnimodal AI model for agents, coding, and long-context tasks
Flagship MoE model for advanced reasoning, coding, and agents
Efficient MoE model for million-token reasoning and coding
FP8 Qwen model for efficient multimodal coding and agent tasks
Agentic 123B coding model optimized for large-scale engineering