State-of-the-art diffusion models for image and audio generation
Everything you need to build state-of-the-art foundation models
Optimizing inference proxy for LLMs
A high-throughput and memory-efficient inference and serving engine
State-of-the-art Parameter-Efficient Fine-Tuning
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
Operating LLMs in production
Multilingual Automatic Speech Recognition with word-level timestamps
Replace OpenAI GPT with another LLM in your app
A Unified Library for Parameter-Efficient Learning
Bring the notion of Model-as-a-Service to life
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Easy-to-use Speech Toolkit including Self-Supervised Learning model
The Triton Inference Server provides an optimized cloud
MII makes low-latency and high-throughput inference possible
Build your chatbot within minutes on your favorite device
PyTorch library of curated Transformer models and their components
Open platform for training, serving, and evaluating language models
High quality, fast, modular reference implementation of SSD in PyTorch
Database system for building simpler and faster AI-powered application
Sequence-to-sequence framework, focused on Neural Machine Translation
OpenMMLab Video Perception Toolbox