PyTorch library of curated Transformer models and their components
State-of-the-art Parameter-Efficient Fine-Tuning
Easiest and laziest way for building multi-agent LLMs applications
Tensor search for humans
MII makes low-latency and high-throughput inference possible
Pytorch domain library for recommendation systems
A set of Docker images for training and serving models in TensorFlow
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Low-latency REST API for serving text-embeddings
20+ high-performance LLMs with recipes to pretrain, finetune at scale
GPU environment management and cluster orchestration
Lightweight Python library for adding real-time multi-object tracking
Create HTML profiling reports from pandas DataFrame objects
Fast inference engine for Transformer models
Powering Amazon custom machine learning chips
Open-Source AI Camera. Empower any camera/CCTV
An MLOps framework to package, deploy, monitor and manage models
Library for serving Transformers models on Amazon SageMaker
A library for accelerating Transformer models on NVIDIA GPUs
Superduper: Integrate AI models and machine learning workflows
A GPU-accelerated library containing highly optimized building blocks
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Integrate, train and manage any AI models and APIs with your database