AIMET is a library that provides advanced quantization and compression
A lightweight vision library for performing large object detection
Serve machine learning models within a Docker container
A GPU-accelerated library containing highly optimized building blocks
Uncover insights, surface problems, monitor, and fine tune your LLM
Implementation of model parallel autoregressive transformers on GPUs
Toolkit for allowing inference and serving with MXNet in SageMaker