Everything you need to build state-of-the-art foundation models
Adversarial Robustness Toolbox (ART) - Python Library for ML security
State-of-the-art diffusion models for image and audio generation
Optimizing inference proxy for LLMs
A high-throughput and memory-efficient inference and serving engine
State-of-the-art Parameter-Efficient Fine-Tuning
Operating LLMs in production
Ready-to-use OCR with 80+ supported languages
Multilingual Automatic Speech Recognition with word-level timestamps
Library for OCR-related tasks powered by Deep Learning
Bring the notion of Model-as-a-Service to life
Replace OpenAI GPT with another LLM in your app
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
A Unified Library for Parameter-Efficient Learning
MII makes low-latency and high-throughput inference possible
Build your chatbot within minutes on your favorite device
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Run Local LLMs on Any Device. Open-source
FlashInfer: Kernel Library for LLM Serving
A library for accelerating Transformer models on NVIDIA GPUs
PyTorch library of curated Transformer models and their components
Simplifies the local serving of AI models from any source
Lightweight Python library for adding real-time multi-object tracking
AIMET is a library that provides advanced quantization and compression
Easiest and laziest way for building multi-agent LLMs applications