Port of Facebook's LLaMA model in C/C++
From Vibe Coding to Agentic Engineering
Models for object and human mesh reconstruction
Official Python inference and LoRA trainer package
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Awesome multilingual OCR toolkits based on PaddlePaddle
Powerful AI language model (MoE) optimized for efficiency/performance
Advanced language and coding AI model
The most powerful local music generation model
Kimi K2 is the large language model series developed by Moonshot AI
An easy 1-click way to create beautiful artwork on your PC using AI
Fast stable diffusion on CPU and AI PC
Qwen3-Coder is the code version of Qwen3
Phi-3.5 for Mac: Locally-run Vision and Language Models
Revolutionizing Database Interactions with Private LLM Technology
Code for running inference with the SAM 3D Body Model 3DB
Agentic, Reasoning, and Coding (ARC) foundation models
Qwen3-TTS is an open-source series of TTS models
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Python bindings for llama.cpp
Open-source, high-performance AI model with advanced reasoning
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Official inference repo for FLUX.1 models
Diffusion Bee is the easiest way to run Stable Diffusion locally