PyTorch library of curated Transformer models and their components
State-of-the-art Parameter-Efficient Fine-Tuning
Easiest and laziest way for building multi-agent LLMs applications
Bring the notion of Model-as-a-Service to life
Tensor search for humans
MII makes low-latency and high-throughput inference possible
Pytorch domain library for recommendation systems
Official inference library for Mistral models
A set of Docker images for training and serving models in TensorFlow
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A toolkit to optimize ML models for deployment for Keras & TensorFlow
A library for accelerating Transformer models on NVIDIA GPUs
Superduper: Integrate AI models and machine learning workflows
20+ high-performance LLMs with recipes to pretrain, finetune at scale
GPU environment management and cluster orchestration
Lightweight Python library for adding real-time multi-object tracking
Create HTML profiling reports from pandas DataFrame objects
Powering Amazon custom machine learning chips
Open-Source AI Camera. Empower any camera/CCTV
Library for serving Transformers models on Amazon SageMaker
Low-latency REST API for serving text-embeddings
A GPU-accelerated library containing highly optimized building blocks
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
An MLOps framework to package, deploy, monitor and manage models