Neural Network Compression Framework for enhanced OpenVINO
Official inference library for Mistral models
AIMET is a library that provides advanced quantization and compression
Superduper: Integrate AI models and machine learning workflows
Standardized Serverless ML Inference Platform on Kubernetes
Library for serving Transformers models on Amazon SageMaker
A set of Docker images for training and serving models in TensorFlow
Powering Amazon custom machine learning chips
Library for OCR-related tasks powered by Deep Learning
A unified framework for scalable computing
Unified Model Serving Framework
An MLOps framework to package, deploy, monitor and manage models
Run Local LLMs on Any Device. Open-source
Everything you need to build state-of-the-art foundation models
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Simplifies the local serving of AI models from any source
Deep learning optimization library: makes distributed training easy
The unofficial python package that returns response of Google Bard
OpenMMLab Model Deployment Framework
A computer vision framework to create and deploy apps in minutes
Framework that is dedicated to making neural data processing
LLMFlows - Simple, Explicit and Transparent LLM Apps
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Toolkit for allowing inference and serving with MXNet in SageMaker
Deploy a ML inference service on a budget in 10 lines of code