The official Python client for the Huggingface Hub
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Serve, optimize and scale PyTorch models in production
Lightweight Python library for adding real-time multi-object tracking
High quality, fast, modular reference implementation of SSD in PyTorch
The Triton Inference Server provides an optimized cloud
LLM Chatbot Assistant for Openfire server
High-level Deep Learning Framework written in Kotlin
OpenMMLab Video Perception Toolbox