C++ library for high performance inference on NVIDIA GPUs
A high-throughput and memory-efficient inference and serving engine
A RWKV management and startup tool, full automation, only 8MB
User-friendly AI Interface
Trainable models and NN optimization tools
Unified Model Serving Framework
Data manipulation and transformation for audio signal processing
Low-latency REST API for serving text-embeddings
A set of Docker images for training and serving models in TensorFlow
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Pytorch domain library for recommendation systems
Easy-to-use deep learning framework with 3 key features
A computer vision framework to create and deploy apps in minutes
Toolkit for allowing inference and serving with MXNet in SageMaker