Qwen-Image is a powerful image generation foundation model
Image polygonal annotation with Python
The largest collection of PyTorch image encoders / backbones
Real time face swap and one-click video deepfake
Provides line-oriented text file editing capabilities
Focus on prompting and generating
NLP Cloud serves high performance pre-trained or custom models for NER
Web interface for generating images using Stable Diffusion models
Stable Diffusion web UI
OCRmyPDF adds an OCR text layer to scanned PDF files
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
A Powerful Native Multimodal Model for Image Generation
Stable Diffusion built-in to Blender
Official DeiT repository
Stable Diffusion with Core ML on Apple Silicon
Label Studio is a multi-type data labeling and annotation tool
Easily turn large sets of image urls to an image dataset
InvokeAI is a leading creative engine for Stable Diffusion models
Contexts Optical Compression
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Chat & pretrained large vision language model
Awesome multilingual OCR toolkits based on PaddlePaddle
The most powerful and modular diffusion model GUI, api and backend
Code for running inference with the SAM 3D Body Model 3DB