A high-performance ML model serving framework, offers dynamic batching
A library for accelerating Transformer models on NVIDIA GPUs
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
Tensor search for humans
AIMET is a library that provides advanced quantization and compression
A lightweight vision library for performing large object detection
The unofficial python package that returns response of Google Bard
Large Language Model Text Generation Inference
Visual Instruction Tuning: Large Language-and-Vision Assistant
State-of-the-art Parameter-Efficient Fine-Tuning
Standardized Serverless ML Inference Platform on Kubernetes
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
A set of Docker images for training and serving models in TensorFlow
Single-cell analysis in Python
Gaussian processes in TensorFlow
Openai style api for open large language models
Data manipulation and transformation for audio signal processing
Open platform for training, serving, and evaluating language models
Multilingual Automatic Speech Recognition with word-level timestamps
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
DoWhy is a Python library for causal inference
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Uplift modeling and causal inference with machine learning algorithms
FlashInfer: Kernel Library for LLM Serving