Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Image generation model with single-stream diffusion transformer
From Images to High-Fidelity 3D Assets
Python inference and LoRA trainer package for the LTX-2 audio–video
Qwen3.5 is the large language model series developed by Qwen team
Text and image to video generation: CogVideoX and CogVideo
Moonshot's most powerful AI model
Qwen3-TTS is an open-source series of TTS models
State-of-the-art TTS model under 25MB
Reference PyTorch implementation and models for DINOv3
Open-source multi-speaker long-form text-to-speech model
Models for object and human mesh reconstruction
A Family of Open Sourced Music Foundation Models
Qwen2.5-VL is the multimodal large language model series
A multimodal model for brain response prediction
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Qwen3 is the large language model series developed by Qwen team
The official repo of Qwen chat & pretrained large language model
LTX-Video Support for ComfyUI
Lets make video diffusion practical
A Systematic Framework for Interactive World Modeling
High-Resolution Image Synthesis with Latent Diffusion Models
An experimental version of DeepSeek model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Qwen3.6 is the large language model series developed by Qwen team