Port of OpenAI's Whisper model in C/C++
High-performance neural network inference framework for mobile
Low-latency AI inference engine optimized for mobile devices
An Easy-to-Use and High-Performance AI Deployment Framework
QVAC Fabric: cross-platform LLM inference and fine-tuning
Deep learning inference framework optimized for mobile platforms