An MLOps framework to package, deploy, monitor and manage models
Trainable models and NN optimization tools
A library for accelerating Transformer models on NVIDIA GPUs
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Build your chatbot within minutes on your favorite device
Efficient few-shot learning with Sentence Transformers
Library for serving Transformers models on Amazon SageMaker
Easy-to-use Speech Toolkit including Self-Supervised Learning model
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Standardized Serverless ML Inference Platform on Kubernetes
Library for OCR-related tasks powered by Deep Learning
A Unified Library for Parameter-Efficient Learning
A unified framework for scalable computing
Superduper: Integrate AI models and machine learning workflows
A high-performance ML model serving framework, offers dynamic batching
Open platform for training, serving, and evaluating language models
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Powering Amazon custom machine learning chips
Probabilistic reasoning and statistical analysis in TensorFlow
Integrate, train and manage any AI models and APIs with your database
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
MII makes low-latency and high-throughput inference possible
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans