Real-time NVIDIA GPU dashboard
How to optimize some algorithm in cuda
A cross-platform, GPU-accelerated terminal emulator
157 models, 30 providers, one command to find what runs on hardware
Modify game profiles inside the internal driver database
A tool for converting Xbox 360 shaders to HLSL
Running a big model on a small laptop
The CUDA target for Numba
A high-performance inference engine for AI models
Butterchurn is a WebGL implementation of the Milkdrop Visualizer
Python inference and LoRA trainer package for the LTX-2 audio–video
A fast cache that automatically deletes the least recently used items
A high-Performance real-time 2D plotting library based on native WebGL
Ongoing research training transformer models at scale
HeavyDB (formerly MapD/OmniSciDB)
Official mirror of libplacebo
UCCL is an efficient communication library for GPUs
Unified KV Cache Compression Methods for Auto-Regressive Models
Alibaba's high-performance LLM inference engine for diverse apps
Easily compute clip embeddings and build a clip retrieval system
GPU-accelerated GUI development for Node.js and the browser
FlashMLA: Efficient Multi-head Latent Attention Kernels
The Zoo Design Studio app
RL implementations
A game engine with an emphasis on real-time cutting-edge solutions