An MLOps framework to package, deploy, monitor and manage models
Operating LLMs in production
Training and deploying machine learning models on Amazon SageMaker
OpenVINO™ Toolkit repository
On-device Speech Recognition for Apple Silicon
Replace OpenAI GPT with another LLM in your app
Low-latency REST API for serving text-embeddings
Probabilistic reasoning and statistical analysis in TensorFlow
High-performance neural network inference framework for mobile
Easiest and laziest way for building multi-agent LLMs applications
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Run serverless GPU workloads with fast cold starts on bare-metal
lightweight, standalone C++ inference engine for Google's Gemma models
Official inference library for Mistral models
Open-Source AI Camera. Empower any camera/CCTV
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Superduper: Integrate AI models and machine learning workflows
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Set of comprehensive computer vision & machine intelligence libraries
Integrate, train and manage any AI models and APIs with your database
LLM training code for MosaicML foundation models
A unified framework for scalable computing
Powering Amazon custom machine learning chips
An easy-to-use LLMs quantization package with user-friendly apis