Port of OpenAI's Whisper model in C/C++
A Pythonic framework to simplify AI service building
A unified framework for scalable computing
A high-performance ML model serving framework, offers dynamic batching
A library for accelerating Transformer models on NVIDIA GPUs
Bring the notion of Model-as-a-Service to life
Powering Amazon custom machine learning chips
Unified Model Serving Framework
Set of comprehensive computer vision & machine intelligence libraries
Implementation of "Tree of Thoughts
llama.go is like llama.cpp in pure Golang