Serve machine learning models within a Docker container
Implementation of "Tree of Thoughts
High-level Deep Learning Framework written in Kotlin
llama.go is like llama.cpp in pure Golang
Guide to deploying deep-learning inference networks
Toolkit for allowing inference and serving with MXNet in SageMaker
Deep learning inference framework optimized for mobile platforms
Uniform deep learning inference framework for mobile
Deploy a ML inference service on a budget in 10 lines of code