Build your chatbot within minutes on your favorite device
Easiest and laziest way for building multi-agent LLMs applications
Multilingual Automatic Speech Recognition with word-level timestamps
Tensor search for humans
Pytorch domain library for recommendation systems
Bring the notion of Model-as-a-Service to life
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Unified Model Serving Framework
A set of Docker images for training and serving models in TensorFlow
Superduper: Integrate AI models and machine learning workflows
20+ high-performance LLMs with recipes to pretrain, finetune at scale
GPU environment management and cluster orchestration
Lightweight Python library for adding real-time multi-object tracking
An MLOps framework to package, deploy, monitor and manage models
Create HTML profiling reports from pandas DataFrame objects
Low-latency REST API for serving text-embeddings
A library for accelerating Transformer models on NVIDIA GPUs
Open platform for training, serving, and evaluating language models
MII makes low-latency and high-throughput inference possible
Powering Amazon custom machine learning chips
AIMET is a library that provides advanced quantization and compression
High quality, fast, modular reference implementation of SSD in PyTorch
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
Library for serving Transformers models on Amazon SageMaker