A toolkit to optimize ML models for deployment for Keras & TensorFlow
C++ library for high performance inference on NVIDIA GPUs
ONNX Runtime: cross-platform, high performance ML inferencing
Bolt is a deep learning library with high performance
Trainable models and NN optimization tools
Easy-to-use deep learning framework with 3 key features
Framework that is dedicated to making neural data processing
CPU/GPU inference server for Hugging Face transformer models