Deep learning optimization library: makes distributed training easy
Neural Network Compression Framework for enhanced OpenVINO
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
AIMET is a library that provides advanced quantization and compression
Libraries for applying sparsification recipes to neural networks
Build your chatbot within minutes on your favorite device
Open platform for training, serving, and evaluating language models