Python bindings for llama.cpp
Powerful AI language model (MoE) optimized for efficiency/performance
The official repo of Qwen chat & pretrained large language model
Open-source, high-performance AI model with advanced reasoning
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Phi-3.5 for Mac: Locally-run Vision and Language Models
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Open Source Speech Language Model
Open-source large language model family from Tencent Hunyuan
Revolutionizing Database Interactions with Private LLM Technology
Advanced language and coding AI model
Qwen3 is the large language model series developed by Qwen team
CogView4, CogView3-Plus and CogView3(ECCV 2024)
DeepSeek Coder: Let the Code Write Itself
Official inference repo for FLUX.1 models
Agentic, Reasoning, and Coding (ARC) foundation models
Z80-μLM is a 2-bit quantized language model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Chinese and English multimodal conversational language model
Open-Source Financial Large Language Models
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Large-language-model & vision-language-model based on Linear Attention
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen3-ASR is an open-source series of ASR models