Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Clean and efficient FP8 GEMM kernels with fine-grained scaling
High-Resolution Image Synthesis with Latent Diffusion Models
An experimental version of DeepSeek model
Hackable and optimized Transformers building blocks
Miso TTS is an 8 billion, highly emotive text-to-speech model
4M: Massively Multimodal Masked Modeling
A Conversational Speech Generation Model