A scalable inference server for models optimized with OpenVINO
VMZ: Model Zoo for Video Modeling
Port of Facebook's LLaMA model in C/C++
Port of OpenAI's Whisper model in C/C++
LLM inference in C/C++
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A lightweight 3D Morphable Face Model library in modern C++
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Testing tool for modeling GUI transitions
Open source AI model for generating full songs from lyrics prompts
Mooncake is the serving platform for Kimi
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
LiteRT, successor to TensorFlow Lite
MLX: An array framework for Apple silicon
TT-NN operator library, and TT-Metalium low level kernel programming
High-performance neural network inference framework for mobile
OCR offline image text recognition command line windows program
Alibaba's high-performance LLM inference engine for diverse apps
ONNX Runtime: cross-platform, high performance ML inferencing
Agentic browser; privacy-first alternative to ChatGPT Atlas
Distribute and run LLMs with a single file
Run Local LLMs on Any Device. Open-source
Serving system for machine learning models
OpenVINO™ Toolkit repository
Unsupervised text tokenizer for Neural Network-based text generation