A Pythonic framework to simplify AI service building
A unified framework for scalable computing
A high-performance ML model serving framework, offers dynamic batching
Bring the notion of Model-as-a-Service to life
A library for accelerating Transformer models on NVIDIA GPUs
Unified Model Serving Framework
Powering Amazon custom machine learning chips
Implementation of "Tree of Thoughts