LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Neural Network Compression Framework for enhanced OpenVINO
AIMET is a library that provides advanced quantization and compression
Build your chatbot within minutes on your favorite device
OpenMMLab Model Deployment Framework