Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Clean and efficient FP8 GEMM kernels with fine-grained scaling
VMZ: Model Zoo for Video Modeling
FlashMLA: Efficient Multi-head Latent Attention Kernels
Real-time behaviour synthesis with MuJoCo, using Predictive Control
A fast, local neural text to speech system
Locally run an Instruction-Tuned Chat-Style LLM