Self-contained Machine Learning and Natural Language Processing lib
High quality, fast, modular reference implementation of SSD in PyTorch
Serve machine learning models within a Docker container
Library for serving Transformers models on Amazon SageMaker
A high-performance ML model serving framework, offers dynamic batching
Database system for building simpler and faster AI-powered application
Serving system for machine learning models
Open platform for training, serving, and evaluating language models
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Implementation of "Tree of Thoughts
High-level Deep Learning Framework written in Kotlin
llama.go is like llama.cpp in pure Golang
A graphical manager for ollama that can manage your LLMs
LLM Chatbot Assistant for Openfire server
Guide to deploying deep-learning inference networks
Deep learning inference framework optimized for mobile platforms
Uniform deep learning inference framework for mobile
Deploy a ML inference service on a budget in 10 lines of code