Qwen-Image is a powerful image generation foundation model
General-purpose image editing model that delivers high-fidelity
Modular AI image and video generation web UI with extensible tools
Stable Diffusion web UI
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
AI video generator optimized for low VRAM and older GPUs use
Stable Diffusion web UI
Text and image to video generation: CogVideoX and CogVideo
Official inference repo for FLUX.1 models
A Unified Framework for Text-to-3D and Image-to-3D Generation
Reverse engineering Gemini's SynthID detection
The most powerful and modular diffusion model GUI, api and backend
InvokeAI is a leading creative engine for Stable Diffusion models
All-in-one WebUI for AI generative image and video creation
Flexible Photo Recrafting While Preserving Your Identity
Recovering the Visual Space from Any Views
Native and Compact Structured Latents for 3D Generation
Unsupervised Learning for Image Registration
A SOTA open-source image editing model
Run a full local LLM stack with one command using Docker
Towards Real-World Vision-Language Understanding
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Official MiniMax Model Context Protocol (MCP) server
GeoAI: Artificial Intelligence for Geospatial Data
Generate high-definition story short videos with one click using AI