GPU environment management and cluster orchestration
Library for serving Transformers models on Amazon SageMaker
Neural Network Compression Framework for enhanced OpenVINO
Database system for building simpler and faster AI-powered application
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Toolbox of models, callbacks, and datasets for AI/ML researchers
A computer vision framework to create and deploy apps in minutes
Lightweight anchor-free object detection model
Sequence-to-sequence framework, focused on Neural Machine Translation
Implementation of model parallel autoregressive transformers on GPUs
Toolkit for allowing inference and serving with MXNet in SageMaker
CPU/GPU inference server for Hugging Face transformer models
Deploy a ML inference service on a budget in 10 lines of code