A high-throughput and memory-efficient inference and serving engine
Unified Model Serving Framework
Trainable models and NN optimization tools
Data manipulation and transformation for audio signal processing
Low-latency REST API for serving text-embeddings
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
A set of Docker images for training and serving models in TensorFlow
Pytorch domain library for recommendation systems
A computer vision framework to create and deploy apps in minutes
Toolkit for allowing inference and serving with MXNet in SageMaker