C++ library for high performance inference on NVIDIA GPUs
Set of comprehensive computer vision & machine intelligence libraries
Everything you need to build state-of-the-art foundation models
The AI-native (edge and LLM) proxy for agents
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
A GPU-accelerated library containing highly optimized building blocks
Build your chatbot within minutes on your favorite device
Run serverless GPU workloads with fast cold starts on bare-metal
MNN is a blazing fast, lightweight deep learning framework
Build Production-ready Agentic Workflow with Natural Language
An MLOps framework to package, deploy, monitor and manage models
Powering Amazon custom machine learning chips
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Framework that is dedicated to making neural data processing
LLMFlows - Simple, Explicit and Transparent LLM Apps
Toolkit for allowing inference and serving with MXNet in SageMaker