Inference framework for 1-bit LLMs
Implementation of model parallel autoregressive transformers on GPUs
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Generating Immersive, Explorable, and Interactive 3D Worlds from Words
Mirror of Ultralytics YOLO-World model weights for object detection
Speaker segmentation model for 10s audio chunks with powerset labels
Latent diffusion model for high-quality text-to-image generation
Bilingual 6.2B parameter chatbot optimized for Chinese and English
Detects speech activity in audio using pyannote.audio 2.1 pipeline