Collection of various algorithms in mathematics, machine learning
Port of Facebook's LLaMA model in C/C++
Build your own AI friend
Speech-to-text, text-to-speech, and speaker recognition
Run Local LLMs on Any Device. Open-source
LLM inference in C/C++
Distribute and run LLMs with a single file
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
OpenVINO™ Toolkit repository
Emscripten: An LLVM-to-WebAssembly Compiler
Fast Multimodal LLM on Mobile Devices
Official inference framework for 1-bit LLMs
Code for Cicero, an AI agent that plays the game of Diplomacy
High-speed Large Language Model Serving for Local Deployment
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Production ready toolkit to run AI locally
Open Source Computer Vision Library
Framework for building AI-powered interactive digital humans and agent
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Unsupervised text tokenizer for Neural Network-based text generation
Alibaba's high-performance LLM inference engine for diverse apps
Mooncake is the serving platform for Kimi
An Easy-to-Use and High-Performance AI Deployment Framework
Speech Note Linux app. Note taking, reading and translating
LiteRT-LM is Google's production-ready inference framework