Self-hosted, community-driven, local OpenAI compatible API
Private Open AI on Kubernetes
Zep: A long-term memory store for LLM / Chatbot applications
Aqueduct allows you to run LLM and ML workloads on any infrastructure
llama.go is like llama.cpp in pure Golang
Qwen2.5-Coder is the code version of Qwen2.5, the large language model