Image generation model with single-stream diffusion transformer
Qwen3-TTS is an open-source series of TTS models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Official inference repo for FLUX.2 models
Qwen3-Coder is the code version of Qwen3
Text and image to video generation: CogVideoX and CogVideo
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
A Family of Open Sourced Music Foundation Models
Open-source multi-speaker long-form text-to-speech model
A theoretical reconstruction of the Claude Mythos architecture
A multimodal model for brain response prediction
Reference PyTorch implementation and models for DINOv3
Models for object and human mesh reconstruction
Qwen3.5 is the large language model series developed by Qwen team
Qwen3 is the large language model series developed by Qwen team
Python bindings for llama.cpp
LTX-Video Support for ComfyUI
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
High-Resolution Image Synthesis with Latent Diffusion Models
Lets make video diffusion practical
State-of-the-art TTS model under 25MB
Foundation model for image generation
Proxy that exposes Antigravity provided claude / gemini models
The official repo of Qwen chat & pretrained large language model