Port of OpenAI's Whisper model in C/C++
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
Fast inference engine for Transformer models
MNN is a blazing fast, lightweight deep learning framework
ONNX Runtime: cross-platform, high performance ML inferencing
Deep Learning API and Server in C++14 support for Caffe, PyTorch
State-of-the-art diffusion models for image and audio generation
A Pythonic framework to simplify AI service building
OpenVINO™ Toolkit repository
A general-purpose probabilistic programming system
High-performance neural network inference framework for mobile
C++ library for high performance inference on NVIDIA GPUs
Open-Source AI Camera. Empower any camera/CCTV
LLM training code for MosaicML foundation models
Protect and discover secrets using Gitleaks
Open standard for machine learning interoperability
Serving system for machine learning models
Set of comprehensive computer vision & machine intelligence libraries
A library for accelerating Transformer models on NVIDIA GPUs
Pure C++ implementation of several models for real-time chatting
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
lightweight, standalone C++ inference engine for Google's Gemma models
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Replace OpenAI GPT with another LLM in your app