FLUX.2 is a state-of-the-art open-weight image generation and editing model released by Black Forest Labs aimed at bridging the gap between research-grade capabilities and production-ready workflows. The model offers both text-to-image generation and powerful image editing, including editing of multiple reference images, with fidelity, consistency, and realism that push the limits of what open-source generative models have achieved. It supports high-resolution output (up to ~4 megapixels), which allows for photography-quality images, detailed product shots, infographics or UI mockups rather than just low-resolution drafts. FLUX.2 is built with a modern architecture (a flow-matching transformer + a revamped VAE + a strong vision-language encoder), enabling strong prompt adherence, correct rendering of text/typography in images, reliable lighting, layout, and physical realism, and consistent style/character/product identity across multiple generations or edits.
Features
- Open-weight, 32 billion-parameter image generation and editing model combining text-to-image and multi-reference editing
- Native output up to ~4 megapixels for high-resolution, production-quality images
- Strong prompt adherence, precise text rendering, and accurate layout/typography support for UI design and infographics
- Multi-reference editing: accept multiple input images (e.g., a style reference + lighting reference + subject reference) and produce consistent outputs with preserved identity/style
- Official inference code in Python, enabling local deployment (with sufficient GPU VRAM) or integration in custom pipelines
- Available via hosted API (“pro/flex”) as well as open dev weights — flexible licensing for research, creative, or commercial workflows