Port of OpenAI's Whisper model in C/C++
High-performance neural network inference framework for mobile
A high-performance ML model serving framework, offers dynamic batching
MNN is a blazing fast, lightweight deep learning framework
PArallel Distributed Deep LEarning: Machine Learning Framework
Easy-to-use deep learning framework with 3 key features
A library for accelerating Transformer models on NVIDIA GPUs
Bolt is a deep learning library with high performance
Build Production-ready Agentic Workflow with Natural Language
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Set of comprehensive computer vision & machine intelligence libraries
llama.go is like llama.cpp in pure Golang
Deep learning inference framework optimized for mobile platforms
Uniform deep learning inference framework for mobile
Fast and user-friendly runtime for transformer inference