Operating LLMs in production
An MLOps framework to package, deploy, monitor and manage models
Training and deploying machine learning models on Amazon SageMaker
Replace OpenAI GPT with another LLM in your app
Low-latency REST API for serving text-embeddings
Probabilistic reasoning and statistical analysis in TensorFlow
Easiest and laziest way for building multi-agent LLMs applications
Official inference library for Mistral models
20+ high-performance LLMs with recipes to pretrain, finetune at scale
The Triton Inference Server provides an optimized cloud
Superduper: Integrate AI models and machine learning workflows
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Powering Amazon custom machine learning chips
Integrate, train and manage any AI models and APIs with your database
LLM training code for MosaicML foundation models
A unified framework for scalable computing
An easy-to-use LLMs quantization package with user-friendly apis
Images to inference with no labeling
A computer vision framework to create and deploy apps in minutes
Training & Implementation of chatbots leveraging GPT-like architecture
CPU/GPU inference server for Hugging Face transformer models
Deploy a ML inference service on a budget in 10 lines of code