A Pythonic framework to simplify AI service building
A set of Docker images for training and serving models in TensorFlow
Integrate, train and manage any AI models and APIs with your database
Pytorch domain library for recommendation systems
Open-Source AI Camera. Empower any camera/CCTV
Operating LLMs in production
Official inference library for Mistral models
Lightweight Python library for adding real-time multi-object tracking
Bring the notion of Model-as-a-Service to life
Library for OCR-related tasks powered by Deep Learning
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Open platform for training, serving, and evaluating language models
A high-performance ML model serving framework, offers dynamic batching
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Framework that is dedicated to making neural data processing
Libraries for applying sparsification recipes to neural networks
An easy-to-use LLMs quantization package with user-friendly apis
Optimizing inference proxy for LLMs
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Images to inference with no labeling
Trainable models and NN optimization tools