Bring the notion of Model-as-a-Service to life
An MLOps framework to package, deploy, monitor and manage models
A library for accelerating Transformer models on NVIDIA GPUs
Everything you need to build state-of-the-art foundation models
A Pythonic framework to simplify AI service building
Efficient few-shot learning with Sentence Transformers
Superduper: Integrate AI models and machine learning workflows
Easy-to-use deep learning framework with 3 key features
Framework that is dedicated to making neural data processing
FlashInfer: Kernel Library for LLM Serving
A lightweight vision library for performing large object detection
Unified Model Serving Framework
OpenMMLab Model Deployment Framework
A set of Docker images for training and serving models in TensorFlow
A high-performance ML model serving framework, offers dynamic batching
Trainable models and NN optimization tools
A unified framework for scalable computing
LLMFlows - Simple, Explicit and Transparent LLM Apps
Neural Network Compression Framework for enhanced OpenVINO
Powering Amazon custom machine learning chips
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Implementation of "Tree of Thoughts
Lightweight anchor-free object detection model
Implementation of model parallel autoregressive transformers on GPUs
Sequence-to-sequence framework, focused on Neural Machine Translation