Trainable models and NN optimization tools
PyTorch extensions for fast R&D prototyping and Kaggle farming
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Unified Model Serving Framework
Images to inference with no labeling
Probabilistic reasoning and statistical analysis in TensorFlow
Pytorch domain library for recommendation systems
A lightweight vision library for performing large object detection
A toolkit to optimize ML models for deployment for Keras & TensorFlow
A high-performance ML model serving framework, offers dynamic batching
Bring the notion of Model-as-a-Service to life
Phi-3.5 for Mac: Locally-run Vision and Language Models
Superduper: Integrate AI models and machine learning workflows
A set of Docker images for training and serving models in TensorFlow
Standardized Serverless ML Inference Platform on Kubernetes
Tensor search for humans
High quality, fast, modular reference implementation of SSD in PyTorch
OpenMMLab Model Deployment Framework
Framework that is dedicated to making neural data processing
Serve machine learning models within a Docker container
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Toolbox of models, callbacks, and datasets for AI/ML researchers
Lightweight anchor-free object detection model
Sequence-to-sequence framework, focused on Neural Machine Translation
Toolkit for allowing inference and serving with MXNet in SageMaker