Python bindings for llama.cpp
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Port of Facebook's LLaMA model in C/C++
Qwen3 is the large language model series developed by Qwen team
Powerful AI language model (MoE) optimized for efficiency/performance
Qwen3-Coder is the code version of Qwen3
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Agentic, Reasoning, and Coding (ARC) foundation models
ChatGLM-6B: An Open Bilingual Dialogue Language Model
The official repo of Qwen chat & pretrained large language model
DeepSeek Coder: Let the Code Write Itself
Phi-3.5 for Mac: Locally-run Vision and Language Models
Open-source, high-performance AI model with advanced reasoning
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Open-source large language model family from Tencent Hunyuan
A state-of-the-art open visual language model
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
gpt-oss-120b and gpt-oss-20b are two open-weight language models
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Chinese and English multimodal conversational language model
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Towards Real-World Vision-Language Understanding