A library for accelerating Transformer models on NVIDIA GPUs
A GPU-accelerated library containing highly optimized building blocks
Data manipulation and transformation for audio signal processing
Libraries for applying sparsification recipes to neural networks
OpenMLDB is an open-source machine learning database
Images to inference with no labeling
A Pythonic framework to simplify AI service building
Deep learning optimization library: makes distributed training easy
A toolkit to optimize ML models for deployment for Keras & TensorFlow
MII makes low-latency and high-throughput inference possible
AIMET is a library that provides advanced quantization and compression
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Pytorch domain library for recommendation systems
Multilingual Automatic Speech Recognition with word-level timestamps
Library for serving Transformers models on Amazon SageMaker
Bring the notion of Model-as-a-Service to life
PyTorch extensions for fast R&D prototyping and Kaggle farming
Unified Model Serving Framework
Create HTML profiling reports from pandas DataFrame objects
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Neural Network Compression Framework for enhanced OpenVINO
High quality, fast, modular reference implementation of SSD in PyTorch
Superduper: Integrate AI models and machine learning workflows
Database system for building simpler and faster AI-powered application
Easy-to-use Speech Toolkit including Self-Supervised Learning model