Run Local LLMs on Any Device. Open-source
A library for accelerating Transformer models on NVIDIA GPUs
Easy-to-use Speech Toolkit including Self-Supervised Learning model
The Triton Inference Server provides an optimized cloud
Lightweight anchor-free object detection model
Sequence-to-sequence framework, focused on Neural Machine Translation
Training & Implementation of chatbots leveraging GPT-like architecture