A toolkit to optimize ML models for deployment for Keras & TensorFlow
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Build your chatbot within minutes on your favorite device
Trainable models and NN optimization tools
Framework that is dedicated to making neural data processing
CPU/GPU inference server for Hugging Face transformer models