A library for accelerating Transformer models on NVIDIA GPUs
Deep learning optimization library: makes distributed training easy
PyTorch extensions for fast R&D prototyping and Kaggle farming
Tensor search for humans
A Pythonic framework to simplify AI service building
MII makes low-latency and high-throughput inference possible
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Library for serving Transformers models on Amazon SageMaker
AIMET is a library that provides advanced quantization and compression
Database system for building simpler and faster AI-powered application
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Single-cell analysis in Python
DoWhy is a Python library for causal inference
An Open-Source Programming Framework for Agentic AI
Uplift modeling and causal inference with machine learning algorithms
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Unified Model Serving Framework
Neural Network Compression Framework for enhanced OpenVINO
Uncover insights, surface problems, monitor, and fine tune your LLM
An MLOps framework to package, deploy, monitor and manage models
Serve machine learning models within a Docker container
Everything you need to build state-of-the-art foundation models
Bolt is a deep learning library with high performance
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
A lightweight vision library for performing large object detection