Fast stable diffusion on CPU and AI PC
Text and image to video generation: CogVideoX and CogVideo
Fast-stable-diffusion + DreamBooth
Diffusion Transformer with Fine-Grained Chinese Understanding
Chinese and English multimodal conversational language model
Generating Immersive, Explorable, and Interactive 3D Worlds
High-Resolution Image Synthesis with Latent Diffusion Models
Capable of understanding text, audio, vision, video
Qwen3-omni is a natively end-to-end, omni-modal LLM
Language modeling in a sentence representation space
Easy Docker setup for Stable Diffusion with user-friendly UI
A state-of-the-art open visual language model