AIMET is a library that provides advanced quantization and compression
Standardized Serverless ML Inference Platform on Kubernetes
An MLOps framework to package, deploy, monitor and manage models
20+ high-performance LLMs with recipes to pretrain, finetune at scale
A Unified Library for Parameter-Efficient Learning
A graphical manager for ollama that can manage your LLMs