C++ library for high performance inference on NVIDIA GPUs
ONNX Runtime: cross-platform, high performance ML inferencing
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Guide to deploying deep-learning inference networks
Deep learning inference framework optimized for mobile platforms
Uniform deep learning inference framework for mobile