Run serverless GPU workloads with fast cold starts on bare-metal
Multilingual Automatic Speech Recognition with word-level timestamps
Deep Learning API and Server in C++14 support for Caffe, PyTorch
GPU environment management and cluster orchestration
A real time inference engine for temporal logical specifications