Python bindings for llama.cpp
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Port of Facebook's LLaMA model in C/C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Powerful AI language model (MoE) optimized for efficiency/performance
Tiny vision language model
Qwen3-Coder is the code version of Qwen3
New family of code large language models (LLMs)
Phi-3.5 for Mac: Locally-run Vision and Language Models
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
The official repo of Qwen chat & pretrained large language model
Open Source Speech Language Model
Qwen3 is the large language model series developed by Qwen team
Advanced language and coding AI model
Open-source, high-performance AI model with advanced reasoning
Open-source large language model family from Tencent Hunyuan
DeepSeek Coder: Let the Code Write Itself
Revolutionizing Database Interactions with Private LLM Technology
Open-Source Financial Large Language Models
Official inference repo for FLUX.1 models
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Agentic, Reasoning, and Coding (ARC) foundation models
Z80-μLM is a 2-bit quantized language model