Port of OpenAI's Whisper model in C/C++
Uncover insights, surface problems, monitor, and fine tune your LLM
A toolkit to optimize ML models for deployment for Keras & TensorFlow
AIMET is a library that provides advanced quantization and compression
A graphical manager for ollama that can manage your LLMs
Guide to deploying deep-learning inference networks
Deep learning inference framework optimized for mobile platforms