PyTorch extensions for fast R&D prototyping and Kaggle farming
Data manipulation and transformation for audio signal processing
On-device AI across mobile, embedded and edge for PyTorch
Serve, optimize and scale PyTorch models in production
ONNX Runtime: cross-platform, high performance ML inferencing
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Powering Amazon custom machine learning chips
OpenVINO™ Toolkit repository
Pytorch domain library for recommendation systems
A set of Docker images for training and serving models in TensorFlow
Libraries for applying sparsification recipes to neural networks
Neural Network Compression Framework for enhanced OpenVINO
Sparsity-aware deep learning inference runtime for CPUs
Adversarial Robustness Toolbox (ART) - Python Library for ML security
State-of-the-art diffusion models for image and audio generation
An Open-Source Programming Framework for Agentic AI
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
The Triton Inference Server provides an optimized cloud
Trainable models and NN optimization tools
Unified Model Serving Framework
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Standardized Serverless ML Inference Platform on Kubernetes
A unified framework for scalable computing
Integrate, train and manage any AI models and APIs with your database
Tensor search for humans