State-of-the-art diffusion models for image and audio generation
PyTorch extensions for fast R&D prototyping and Kaggle farming
Libraries for applying sparsification recipes to neural networks
Open-Source AI Camera. Empower any camera/CCTV
A library for accelerating Transformer models on NVIDIA GPUs
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Lightweight Python library for adding real-time multi-object tracking
Openai style api for open large language models
Images to inference with no labeling
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
An MLOps framework to package, deploy, monitor and manage models
Open platform for training, serving, and evaluating language models
Low-latency REST API for serving text-embeddings
Probabilistic reasoning and statistical analysis in TensorFlow
Unified Model Serving Framework
Library for serving Transformers models on Amazon SageMaker
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
GPU environment management and cluster orchestration
A Unified Library for Parameter-Efficient Learning
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Fast inference engine for Transformer models
State-of-the-art Parameter-Efficient Fine-Tuning
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Deep learning optimization library: makes distributed training easy