Qwen-Image is a powerful image generation foundation model
Foundation model for image generation
EPUB to audiobook converter, optimized for Audiobookshelf
General-purpose image editing model that delivers high-fidelity
Focus on prompting and generating
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
A Powerful Native Multimodal Model for Image Generation
OCRmyPDF adds an OCR text layer to scanned PDF files
Official inference repo for FLUX.2 models
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Official inference repo for FLUX.1 models
CLIP, Predict the most relevant text snippet given an image
ComfyUI wrapper nodes for HunyuanVideo
Comprehensive Markdown plugin built for Django
Stable Diffusion web UI
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
A Unified Framework for Text-to-3D and Image-to-3D Generation
Label Studio is a multi-type data labeling and annotation tool
Official MiniMax Model Context Protocol (MCP) server
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Collection of Gemma 3 variants that are trained for performance
Stable Diffusion built-in to Blender
Implementation of Imagen, Google's Text-to-Image Neural Network
Text and image to video generation: CogVideoX and CogVideo