C++ library for high performance inference on NVIDIA GPUs
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
ONNX-TensorRT: TensorRT backend for ONNX
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
Serve, optimize and scale PyTorch models in production
The Triton Inference Server provides an optimized cloud
OneFlow is a deep learning framework designed to be user-friendly
CS2, Valorant, Fortnite, APEX, every game
Embed images and sentences into fixed-length vectors
Guide to deploying deep-learning inference networks
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5
C++ library based on tensorrt integration
CPU/GPU inference server for Hugging Face transformer models