AIMET is a library that provides advanced quantization and compression
OpenMMLab Model Deployment Framework
Easy-to-use deep learning framework with 3 key features
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Framework for Accelerating LLM Generation with Multiple Decoding Heads
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
A lightweight vision library for performing large object detection
Powering Amazon custom machine learning chips
Single-cell analysis in Python
A library to communicate with ChatGPT, Claude, Copilot, Gemini
The official Python client for the Huggingface Hub
Uplift modeling and causal inference with machine learning algorithms
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
FlashInfer: Kernel Library for LLM Serving
Everything you need to build state-of-the-art foundation models
Easiest and laziest way for building multi-agent LLMs applications
State-of-the-art Parameter-Efficient Fine-Tuning
Trainable models and NN optimization tools
PyTorch extensions for fast R&D prototyping and Kaggle farming
Probabilistic reasoning and statistical analysis in TensorFlow
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Tensor search for humans
An MLOps framework to package, deploy, monitor and manage models
Unified Model Serving Framework