Port of OpenAI's Whisper model in C/C++
Uncover insights, surface problems, monitor, and fine tune your LLM
Replace OpenAI GPT with another LLM in your app
PyTorch extensions for fast R&D prototyping and Kaggle farming
MII makes low-latency and high-throughput inference possible
PyTorch library of curated Transformer models and their components
Toolkit for allowing inference and serving with MXNet in SageMaker