Run Local LLMs on Any Device. Open-source
An MLOps framework to package, deploy, monitor and manage models
Easy-to-use deep learning framework with 3 key features
AIMET is a library that provides advanced quantization and compression
Superduper: Integrate AI models and machine learning workflows
A unified framework for scalable computing
OpenMMLab Model Deployment Framework
Deep learning optimization library: makes distributed training easy
Standardized Serverless ML Inference Platform on Kubernetes
Unified Model Serving Framework
A set of Docker images for training and serving models in TensorFlow
Neural Network Compression Framework for enhanced OpenVINO
Official inference library for Mistral models
A GPU-accelerated library containing highly optimized building blocks
Framework that is dedicated to making neural data processing
Powering Amazon custom machine learning chips
Library for serving Transformers models on Amazon SageMaker
A computer vision framework to create and deploy apps in minutes
Guide to deploying deep-learning inference networks
Toolkit for allowing inference and serving with MXNet in SageMaker
Deploy a ML inference service on a budget in 10 lines of code