Port of Facebook's LLaMA model in C/C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Agentic, Reasoning, and Coding (ARC) foundation models
Revolutionizing Database Interactions with Private LLM Technology
Python bindings for llama.cpp
Phi-3.5 for Mac: Locally-run Vision and Language Models
Qwen3 is the large language model series developed by Qwen team
Powerful AI language model (MoE) optimized for efficiency/performance
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Open-source, high-performance AI model with advanced reasoning
Wan2.2: Open and Advanced Large-Scale Video Generative Model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
RGBD video generation model conditioned on camera input
Image generation model with single-stream diffusion transformer
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Contexts Optical Compression
State-of-the-art TTS model under 25MB
Qwen-Image is a powerful image generation foundation model
Open-weight, large-scale hybrid-attention reasoning model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
The official repo of Qwen chat & pretrained large language model
Official inference repo for FLUX.2 models