Run Local LLMs on Any Device. Open-source
Standardized Serverless ML Inference Platform on Kubernetes
The Triton Inference Server provides an optimized cloud
Framework that is dedicated to making neural data processing
Library for OCR-related tasks powered by Deep Learning
An MLOps framework to package, deploy, monitor and manage models
AIMET is a library that provides advanced quantization and compression
A set of Docker images for training and serving models in TensorFlow
Official inference library for Mistral models
OpenMMLab Model Deployment Framework
Deep learning optimization library: makes distributed training easy
A unified framework for scalable computing
Neural Network Compression Framework for enhanced OpenVINO
Library for serving Transformers models on Amazon SageMaker
Unified Model Serving Framework
Superduper: Integrate AI models and machine learning workflows
Powering Amazon custom machine learning chips
A computer vision framework to create and deploy apps in minutes
OpenFieldAI is an AI based Open Field Test Rodent Tracker
Toolkit for allowing inference and serving with MXNet in SageMaker
Deploy a ML inference service on a budget in 10 lines of code