Port of OpenAI's Whisper model in C/C++
LiteRT-LM is Google's production-ready inference framework
MNN is a blazing fast, lightweight deep learning framework
Low-latency AI inference engine optimized for mobile devices
Uniform deep learning inference framework for mobile
Dynamic Generalized Relevance Learning Vector Quantization