Real-time NVIDIA GPU dashboard
HunyuanVideo: A Systematic Framework For Large Video Generation Model
High-performance library for gradient boosting on decision trees
High-speed Large Language Model Serving for Local Deployment
157 models, 30 providers, one command to find what runs on hardware
Gradient boosting framework based on decision tree algorithms
How to optimize some algorithm in cuda
Open source machine learning framework
LightLLM is a Python-based LLM (Large Language Model) inference
A high-quality rapid TTS voice cloning model
Running a big model on a small laptop
Pruna is a model optimization framework built for developers
Generate audiobooks from e-books
Easily compute clip embeddings and build a clip retrieval system
Self-supervised visual learning using momentum contrast in PyTorch
Elegant and Performant Deep Learning
Sharp Monocular Metric Depth in Less Than a Second
FlashMLA: Efficient Multi-head Latent Attention Kernels
ChatGLM2-6B: An Open Bilingual Chat LLM
Unified web UI for training and running open models locally
Multi-lingual large voice generation model, providing inference
A high-performance ML model serving framework, offers dynamic batching
A PyTorch-based Speech Toolkit
Fast ML inference & training for ONNX models in Rust
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation