Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Clean and efficient FP8 GEMM kernels with fine-grained scaling
An experimental version of DeepSeek model
Hackable and optimized Transformers building blocks
4M: Massively Multimodal Masked Modeling
Miso TTS is an 8 billion, highly emotive text-to-speech model
AI Suite for upscaling, interpolating & restoring images/videos
A Conversational Speech Generation Model