20+ high-performance LLMs with recipes to pretrain, finetune at scale
GPU environment management and cluster orchestration
A library for accelerating Transformer models on NVIDIA GPUs
Standardized Serverless ML Inference Platform on Kubernetes
A set of Docker images for training and serving models in TensorFlow
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
PyTorch library of curated Transformer models and their components
State-of-the-art Parameter-Efficient Fine-Tuning
LLM training code for MosaicML foundation models
OpenMMLab Model Deployment Framework
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Low-latency REST API for serving text-embeddings
An MLOps framework to package, deploy, monitor and manage models
A lightweight vision library for performing large object detection
Create HTML profiling reports from pandas DataFrame objects
Library for serving Transformers models on Amazon SageMaker
Fast inference engine for Transformer models
Deep learning optimization library: makes distributed training easy
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
High quality, fast, modular reference implementation of SSD in PyTorch
Open-Source AI Camera. Empower any camera/CCTV
AIMET is a library that provides advanced quantization and compression
Powering Amazon custom machine learning chips
A GPU-accelerated library containing highly optimized building blocks