Real-time NVIDIA GPU dashboard
HunyuanVideo: A Systematic Framework For Large Video Generation Model
High-performance library for gradient boosting on decision trees
High-speed Large Language Model Serving for Local Deployment
157 models, 30 providers, one command to find what runs on hardware
Gradient boosting framework based on decision tree algorithms
How to optimize some algorithm in cuda
Faster Whisper transcription with CTranslate2
Open source machine learning framework
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
LightLLM is a Python-based LLM (Large Language Model) inference
A high-quality rapid TTS voice cloning model
Running a big model on a small laptop
Pruna is a model optimization framework built for developers
Generate audiobooks from e-books
Easily compute clip embeddings and build a clip retrieval system
Elegant and Performant Deep Learning
Sharp Monocular Metric Depth in Less Than a Second
FlashMLA: Efficient Multi-head Latent Attention Kernels
ChatGLM2-6B: An Open Bilingual Chat LLM
Unified web UI for training and running open models locally
Multi-lingual large voice generation model, providing inference
Library for OCR-related tasks powered by Deep Learning
A high-performance ML model serving framework, offers dynamic batching
A PyTorch-based Speech Toolkit