C++ library for high performance inference on NVIDIA GPUs
Low-latency REST API for serving text-embeddings
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
A set of Docker images for training and serving models in TensorFlow
Data manipulation and transformation for audio signal processing
Trainable models and NN optimization tools
Unified Model Serving Framework
Pytorch domain library for recommendation systems
Easy-to-use deep learning framework with 3 key features
A computer vision framework to create and deploy apps in minutes
Toolkit for allowing inference and serving with MXNet in SageMaker