A unified framework for scalable computing
A Pythonic framework to simplify AI service building
An MLOps framework to package, deploy, monitor and manage models
Everything you need to build state-of-the-art foundation models
A high-performance ML model serving framework, offers dynamic batching
Neural Network Compression Framework for enhanced OpenVINO
Easy-to-use deep learning framework with 3 key features
FlashInfer: Kernel Library for LLM Serving
Efficient few-shot learning with Sentence Transformers
A library for accelerating Transformer models on NVIDIA GPUs
Superduper: Integrate AI models and machine learning workflows
Framework that is dedicated to making neural data processing
Unified Model Serving Framework
Trainable models and NN optimization tools
OpenMMLab Model Deployment Framework
A set of Docker images for training and serving models in TensorFlow
Powering Amazon custom machine learning chips
A lightweight vision library for performing large object detection
LLMFlows - Simple, Explicit and Transparent LLM Apps
Framework for Accelerating LLM Generation with Multiple Decoding Heads
A computer vision framework to create and deploy apps in minutes
Implementation of "Tree of Thoughts
Implementation of model parallel autoregressive transformers on GPUs
Sequence-to-sequence framework, focused on Neural Machine Translation
OpenMMLab Video Perception Toolbox