A high-performance ML model serving framework, offers dynamic batching
Sparsity-aware deep learning inference runtime for CPUs
Standardized Serverless ML Inference Platform on Kubernetes
Low-latency REST API for serving text-embeddings
High quality, fast, modular reference implementation of SSD in PyTorch
A computer vision framework to create and deploy apps in minutes
Lightweight anchor-free object detection model