Port of OpenAI's Whisper model in C/C++
Fast inference engine for Transformer models
Deep Learning API and Server in C++14 support for Caffe, PyTorch
A library for accelerating Transformer models on NVIDIA GPUs
Serving system for machine learning models
Set of comprehensive computer vision & machine intelligence libraries
llama.go is like llama.cpp in pure Golang
Guide to deploying deep-learning inference networks