Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Libraries for applying sparsification recipes to neural networks
LLM training code for MosaicML foundation models
Standardized Serverless ML Inference Platform on Kubernetes
Library for OCR-related tasks powered by Deep Learning
Optimizing inference proxy for LLMs
Openai style api for open large language models
Images to inference with no labeling
Probabilistic reasoning and statistical analysis in TensorFlow
Build your chatbot within minutes on your favorite device
Easiest and laziest way for building multi-agent LLMs applications
An MLOps framework to package, deploy, monitor and manage models
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Open-source tool designed to enhance the efficiency of workloads
A library for accelerating Transformer models on NVIDIA GPUs
Deep learning optimization library: makes distributed training easy
A Unified Library for Parameter-Efficient Learning
A lightweight vision library for performing large object detection
State-of-the-art Parameter-Efficient Fine-Tuning
Tensor search for humans
A set of Docker images for training and serving models in TensorFlow
Simplifies the local serving of AI models from any source
Open platform for training, serving, and evaluating language models