Multimodal-Driven Architecture for Customized Video Generation
Qwen-Image is a powerful image generation foundation model
Official inference repo for FLUX.2 models
Personalize Any Characters with a Scalable Diffusion Transformer
A Customizable Image-to-Video Model based on HunyuanVideo
super expressive prompting model based on ltx2.3
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Dia-1.6B generates lifelike English dialogue and vocal expressions