Libraries for applying sparsification recipes to neural networks
AIMET is a library that provides advanced quantization and compression
Neural Network Compression Framework for enhanced OpenVINO
Pytorch domain library for recommendation systems
MII makes low-latency and high-throughput inference possible
Tensor search for humans
Superduper: Integrate AI models and machine learning workflows
CPU/GPU inference server for Hugging Face transformer models