Port of Facebook's LLaMA model in C/C++
Official Python inference and LoRA trainer package
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Revolutionizing Database Interactions with Private LLM Technology
From Vibe Coding to Agentic Engineering
Phi-3.5 for Mac: Locally-run Vision and Language Models
Awesome multilingual OCR toolkits based on PaddlePaddle
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
The most powerful local music generation model
Python bindings for llama.cpp
Official inference repo for FLUX.1 models
Proxy that exposes Antigravity provided claude / gemini models
State-of-the-art TTS model under 25MB
Powerful AI language model (MoE) optimized for efficiency/performance
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Open-source, high-performance AI model with advanced reasoning
Qwen3-TTS is an open-source series of TTS models
Industrial-level controllable zero-shot text-to-speech system
Fast stable diffusion on CPU and AI PC
Kimi K2 is the large language model series developed by Moonshot AI
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
MiniMax M2.1, a SOTA model for real-world dev & agents.
Advanced language and coding AI model
Open image model at the forefront of design
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference