Browse free open source C++ Large Language Models (LLM) and projects below. Use the toggles on the left to filter open source C++ Large Language Models (LLM) by OS, license, language, programming language, and project status.
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
Distribute and run LLMs with a single file
Mooncake is the serving platform for Kimi
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
An Easy-to-Use and High-Performance AI Deployment Framework
Emscripten: An LLVM-to-WebAssembly Compiler
TT-NN operator library, and TT-Metalium low level kernel programming
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Alibaba's high-performance LLM inference engine for diverse apps
Locally run an Instruction-Tuned Chat-Style LLM
High-speed Large Language Model Serving for Local Deployment
Implements a reference architecture for creating information systems
A @ClickHouse fork that supports high-performance vector search
Production ready toolkit to run AI locally
UCCL is an efficient communication library for GPUs
Fast Multimodal LLM on Mobile Devices
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model