Open-source multi-speaker long-form text-to-speech model
Qwen3-TTS is an open-source series of TTS models
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Text and image to video generation: CogVideoX and CogVideo
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen3.5 is the large language model series developed by Qwen team
State-of-the-art TTS model under 25MB
A Family of Open Sourced Music Foundation Models
Reference PyTorch implementation and models for DINOv3
Qwen2.5-VL is the multimodal large language model series
Lets make video diffusion practical
The official repo of Qwen chat & pretrained large language model
High-Resolution Image Synthesis with Latent Diffusion Models
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Proxy that exposes Antigravity provided claude / gemini models
A Systematic Framework for Interactive World Modeling
Industrial-level controllable zero-shot text-to-speech system
DeepSeek Coder: Let the Code Write Itself
LTX-Video Support for ComfyUI
An experimental version of DeepSeek model
Advancing Open-source World Models
From Images to High-Fidelity 3D Assets
Qwen3.6 is the large language model series developed by Qwen team
ChatGLM-6B: An Open Bilingual Dialogue Language Model
State of the art LLM and coding model