100–200× Acceleration for Video Diffusion Models
Taming Stable Diffusion for Lip Sync
Lets make video diffusion practical
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Expressive Portrait Image Animation for Live Streaming
Block Diffusion for Ultra-Fast Speculative Decoding
RGBD video generation model conditioned on camera input
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
The most powerful local music generation model
Stable Diffusion web UI
Deep learning framework
InvokeAI is a leading creative engine for Stable Diffusion models
Run the Stable Diffusion releases in a Docker container
Autoregressive Model Beats Diffusion
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Diffusion Transformer with Fine-Grained Chinese Understanding
Official SeedVR2 Video Upscaler for ComfyUI
Open-source multi-speaker long-form text-to-speech model
Image inpainting tool powered by SOTA AI Model
Multimodal Diffusion with Representation Alignment
A unified library of SOTA model optimization techniques
Repo for SeedVR2 & SeedVR
Towards Human-Level Text-to-Speech through Style Diffusion
HY-Motion model for 3D character animation generation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model