Run Local LLMs on Any Device. Open-source
A RWKV management and startup tool, full automation, only 8MB
Operating LLMs in production
The official Python client for the Huggingface Hub
A scalable inference server for models optimized with OpenVINO
A Pythonic framework to simplify AI service building
Serving system for machine learning models