State-of-the-art diffusion models for image and audio generation
Everything you need to build state-of-the-art foundation models
A high-throughput and memory-efficient inference and serving engine
Optimizing inference proxy for LLMs
State-of-the-art Parameter-Efficient Fine-Tuning
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
Open-Source AI Camera. Empower any camera/CCTV
Operating LLMs in production
Multilingual Automatic Speech Recognition with word-level timestamps
Replace OpenAI GPT with another LLM in your app
An Open-Source Programming Framework for Agentic AI
Bring the notion of Model-as-a-Service to life
The Triton Inference Server provides an optimized cloud
A Unified Library for Parameter-Efficient Learning
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Easy-to-use Speech Toolkit including Self-Supervised Learning model
MII makes low-latency and high-throughput inference possible
PyTorch library of curated Transformer models and their components
A real time inference engine for temporal logical specifications
High quality, fast, modular reference implementation of SSD in PyTorch
Database system for building simpler and faster AI-powered application
Sequence-to-sequence framework, focused on Neural Machine Translation
OpenMMLab Video Perception Toolbox