Unofficial (Golang) Go bindings for the Hugging Face Inference API
Large Language Model Text Generation Inference
Library for serving Transformers models on Amazon SageMaker
Openai style api for open large language models
Run 100B+ language models at home, BitTorrent-style
Implementation of "Tree of Thoughts
CPU/GPU inference server for Hugging Face transformer models