New set of lightweight state-of-the-art, open foundation models
Port of Facebook's LLaMA model in C/C++
Powerful AI language model (MoE) optimized for efficiency/performance
Advanced language and coding AI model
Python bindings for llama.cpp
Qwen3 is the large language model series developed by Qwen team
Open-source, high-performance AI model with advanced reasoning
ChatGLM-6B: An Open Bilingual Dialogue Language Model
The official repo of Qwen chat & pretrained large language model
Qwen2.5-VL is the multimodal large language model series
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Agentic, Reasoning, and Coding (ARC) foundation models
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
CogView4, CogView3-Plus and CogView3(ECCV 2024)
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
DeepSeek Coder: Let the Code Write Itself
Phi-3.5 for Mac: Locally-run Vision and Language Models
Chat & pretrained large vision language model
Qwen3-Coder is the code version of Qwen3
Large-language-model & vision-language-model based on Linear Attention
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
A state-of-the-art open visual language model
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Multimodal model achieving SOTA performance