Qwen-Image-Edit is the image editing extension of Qwen-Image, a 20B parameter model that combines advanced visual and text-rendering capabilities for creative and precise editing. It leverages both Qwen2.5-VL for semantic control and a VAE Encoder for appearance control, enabling users to edit at both the content and detail level. The model excels at semantic edits like style transfer, object rotation, and novel view synthesis, while also handling precise appearance edits such as adding or removing elements without altering surrounding regions. A standout feature is its bilingual text editing in English and Chinese, which preserves original font, size, and style during modifications. Benchmarks confirm its state-of-the-art performance in image editing, establishing it as a reliable foundation for both artistic and practical tasks. Its applications span IP creation, meme generation, background changes, clothing edits, and fine corrections in artworks or calligraphy.
Features
- Built on the 20B Qwen-Image model, extended for editing tasks
- Dual control via Qwen2.5-VL (semantics) and VAE Encoder (appearance)
- Supports semantic edits (style transfer, IP creation, novel view synthesis)
- Handles precise appearance edits (element addition/removal, background changes)
- Bilingual (Chinese & English) text editing with font/style preservation
- State-of-the-art performance on multiple public image editing benchmarks
- Enables step-by-step, chained editing for fine-grained corrections
- Licensed under Apache 2.0 for open and flexible use