OpenMMLab Model Deployment Framework
Integrate, train and manage any AI models and APIs with your database
Pytorch domain library for recommendation systems
Bring the notion of Model-as-a-Service to life
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Data manipulation and transformation for audio signal processing
Libraries for applying sparsification recipes to neural networks
An easy-to-use LLMs quantization package with user-friendly apis
Lightweight Python library for adding real-time multi-object tracking
State-of-the-art diffusion models for image and audio generation
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Unified Model Serving Framework
A high-performance ML model serving framework, offers dynamic batching
Framework that is dedicated to making neural data processing
Optimizing inference proxy for LLMs
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Neural Network Compression Framework for enhanced OpenVINO
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Images to inference with no labeling
Easy-to-use deep learning framework with 3 key features
Easiest and laziest way for building multi-agent LLMs applications
Efficient few-shot learning with Sentence Transformers
Trainable models and NN optimization tools
Probabilistic reasoning and statistical analysis in TensorFlow