State-of-the-art diffusion models for image and audio generation
Everything you need to build state-of-the-art foundation models
Optimizing inference proxy for LLMs
A high-throughput and memory-efficient inference and serving engine
State-of-the-art Parameter-Efficient Fine-Tuning
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
Open-Source AI Camera. Empower any camera/CCTV
Operating LLMs in production
Multilingual Automatic Speech Recognition with word-level timestamps
Replace OpenAI GPT with another LLM in your app
An Open-Source Programming Framework for Agentic AI
A Unified Library for Parameter-Efficient Learning
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Bring the notion of Model-as-a-Service to life
The Triton Inference Server provides an optimized cloud
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Easy-to-use Speech Toolkit including Self-Supervised Learning model
MII makes low-latency and high-throughput inference possible
Build your chatbot within minutes on your favorite device
PyTorch library of curated Transformer models and their components
A real time inference engine for temporal logical specifications
Open platform for training, serving, and evaluating language models
High quality, fast, modular reference implementation of SSD in PyTorch
Database system for building simpler and faster AI-powered application