A high-performance ML model serving framework, offers dynamic batching
Database system for building simpler and faster AI-powered application
The unofficial python package that returns response of Google Bard
Lightweight Python library for adding real-time multi-object tracking
OpenAI swift async text to image for SwiftUI app using OpenAI
Self-contained Machine Learning and Natural Language Processing lib
Serving system for machine learning models
Implementation of "Tree of Thoughts
llama.go is like llama.cpp in pure Golang
A graphical launcher for ollama that scans for installed LLMs
Guide to deploying deep-learning inference networks
Deep learning inference framework optimized for mobile platforms
Uniform deep learning inference framework for mobile
Deploy a ML inference service on a budget in 10 lines of code