Port of Facebook's LLaMA model in C/C++
Official Python inference and LoRA trainer package
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Revolutionizing Database Interactions with Private LLM Technology
From Vibe Coding to Agentic Engineering
Phi-3.5 for Mac: Locally-run Vision and Language Models
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Awesome multilingual OCR toolkits based on PaddlePaddle
The most powerful local music generation model
Python bindings for llama.cpp
Official inference repo for FLUX.1 models
Proxy that exposes Antigravity provided claude / gemini models
State-of-the-art TTS model under 25MB
Powerful AI language model (MoE) optimized for efficiency/performance
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Qwen3-TTS is an open-source series of TTS models
Industrial-level controllable zero-shot text-to-speech system
Open-source, high-performance AI model with advanced reasoning
Fast stable diffusion on CPU and AI PC
MiniMax M2.1, a SOTA model for real-world dev & agents.
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Kimi K2 is the large language model series developed by Moonshot AI
Advanced language and coding AI model
Open image model at the forefront of design
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference