The official Python client for the Huggingface Hub
The Triton Inference Server provides an optimized cloud
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Serve, optimize and scale PyTorch models in production
Lightweight Python library for adding real-time multi-object tracking
High quality, fast, modular reference implementation of SSD in PyTorch
High-level Deep Learning Framework written in Kotlin
LLM Chatbot Assistant for Openfire server
OpenMMLab Video Perception Toolbox