LLM training code for MosaicML foundation models
AIMET is a library that provides advanced quantization and compression
A lightweight vision library for performing large object detection
Uplift modeling and causal inference with machine learning algorithms
FlashInfer: Kernel Library for LLM Serving
Optimizing inference proxy for LLMs
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Data manipulation and transformation for audio signal processing
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
An MLOps framework to package, deploy, monitor and manage models
Powering Amazon custom machine learning chips
A set of Docker images for training and serving models in TensorFlow
Deep learning optimization library: makes distributed training easy
Gaussian processes in TensorFlow
DoWhy is a Python library for causal inference
A library to communicate with ChatGPT, Claude, Copilot, Gemini
AI interface for tinkerers (Ollama, Haystack RAG, Python)
Adversarial Robustness Toolbox (ART) - Python Library for ML security
The unofficial python package that returns response of Google Bard
A unified framework for scalable computing
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Libraries for applying sparsification recipes to neural networks
Sparsity-aware deep learning inference runtime for CPUs
An easy-to-use LLMs quantization package with user-friendly apis
Operating LLMs in production