AIMET is a library that provides advanced quantization and compression
C++ library for high performance inference on NVIDIA GPUs
Libraries for applying sparsification recipes to neural networks
Neural Network Compression Framework for enhanced OpenVINO
Bolt is a deep learning library with high performance
Pytorch domain library for recommendation systems
Open-Source AI Camera. Empower any camera/CCTV
Easy-to-use deep learning framework with 3 key features
Port of OpenAI's Whisper model in C/C++
Fast inference engine for Transformer models
MII makes low-latency and high-throughput inference possible
Tensor search for humans
Superduper: Integrate AI models and machine learning workflows
CPU/GPU inference server for Hugging Face transformer models