Open image model at the forefront of design
Contexts Optical Compression
A Family of Open Sourced Music Foundation Models
OCR expert VLM powered by Hunyuan's native multimodal architecture
Renderer for the harmony response format to be used with gpt-oss
Accurate × Fast × Comprehensive
Qwen2.5-VL is the multimodal large language model series
Visual Causal Flow
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Block Diffusion for Ultra-Fast Speculative Decoding
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201