Revolutionizing Database Interactions with Private LLM Technology
Port of Facebook's LLaMA model in C/C++
Official Python inference and LoRA trainer package
From Vibe Coding to Agentic Engineering
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Awesome multilingual OCR toolkits based on PaddlePaddle
Powerful AI language model (MoE) optimized for efficiency/performance
The most powerful local music generation model
Fast stable diffusion on CPU and AI PC
Advanced language and coding AI model
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Python bindings for llama.cpp
An easy 1-click way to create beautiful artwork on your PC using AI
Kimi K2 is the large language model series developed by Moonshot AI
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Qwen3-TTS is an open-source series of TTS models
Phi-3.5 for Mac: Locally-run Vision and Language Models
Official inference repo for FLUX.1 models
Qwen3-Coder is the code version of Qwen3
Open-source, high-performance AI model with advanced reasoning
Agentic, Reasoning, and Coding (ARC) foundation models
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Proxy that exposes Antigravity provided claude / gemini models
Diffusion Bee is the easiest way to run Stable Diffusion locally