Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
AIMET is a library that provides advanced quantization and compression
A lightweight vision library for performing large object detection
Uplift modeling and causal inference with machine learning algorithms
FlashInfer: Kernel Library for LLM Serving
Optimizing inference proxy for LLMs
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Data manipulation and transformation for audio signal processing
An MLOps framework to package, deploy, monitor and manage models
Powering Amazon custom machine learning chips
A set of Docker images for training and serving models in TensorFlow
Deep learning optimization library: makes distributed training easy
Gaussian processes in TensorFlow
DoWhy is a Python library for causal inference
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Adversarial Robustness Toolbox (ART) - Python Library for ML security
The unofficial python package that returns response of Google Bard
A unified framework for scalable computing
Operating LLMs in production
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Libraries for applying sparsification recipes to neural networks
Sparsity-aware deep learning inference runtime for CPUs
An easy-to-use LLMs quantization package with user-friendly apis
Integrate, train and manage any AI models and APIs with your database