Port of Facebook's LLaMA model in C/C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Official Python inference and LoRA trainer package
Revolutionizing Database Interactions with Private LLM Technology
From Vibe Coding to Agentic Engineering
Phi-3.5 for Mac: Locally-run Vision and Language Models
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Awesome multilingual OCR toolkits based on PaddlePaddle
Python bindings for llama.cpp
The most powerful local music generation model
Proxy that exposes Antigravity provided claude / gemini models
Official inference repo for FLUX.1 models
State-of-the-art TTS model under 25MB
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Powerful AI language model (MoE) optimized for efficiency/performance
Qwen3-TTS is an open-source series of TTS models
Industrial-level controllable zero-shot text-to-speech system
MiniMax M2.1, a SOTA model for real-world dev & agents.
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Fast stable diffusion on CPU and AI PC
Open-source, high-performance AI model with advanced reasoning
Open image model at the forefront of design
Kimi K2 is the large language model series developed by Moonshot AI
Advanced language and coding AI model
Visual Causal Flow