Python bindings for llama.cpp
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Agentic, Reasoning, and Coding (ARC) foundation models
Powerful AI language model (MoE) optimized for efficiency/performance
Qwen3-Coder is the code version of Qwen3
Port of Facebook's LLaMA model in C/C++
Open-source, high-performance AI model with advanced reasoning
Qwen3 is the large language model series developed by Qwen team
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Qwen-Image is a powerful image generation foundation model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Pushing the Limits of Mathematical Reasoning in Open Language Models
A Family of Open Foundation Models for Code Intelligence
CodeGeeX2: A More Powerful Multilingual Code Generation Model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Qwen3-omni is a natively end-to-end, omni-modal LLM
Tongyi Deep Research, the Leading Open-source Deep Research Agent
The official repo of Qwen chat & pretrained large language model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Phi-3.5 for Mac: Locally-run Vision and Language Models
FAIR Sequence Modeling Toolkit 2
DeepSeek Coder: Let the Code Write Itself
Renderer for the harmony response format to be used with gpt-oss