Open source libraries and APIs to build custom preprocessing pipelines
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
LLM framework for document understanding and semantic retrieval
A Unified Framework for Image Customization
Flexible Photo Recrafting While Preserving Your Identity
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
Open source framework for deep learning satellite and aerial imagery
Build AI-powered semantic search applications
Easily turn large sets of image urls to an image dataset
Contexts Optical Compression
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Open source personal AI Assistant for Linux, Windows and Mac
We write your reusable computer vision tools
The repository provides code for running inference with SAM 2
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Algorithms for outlier, adversarial and drift detection
Reverse engineering Gemini's SynthID detection
An extensive node suite that enables ComfyUI to process 3D inputs
State-of-the-art diffusion models for image and audio generation
A neural network that transforms a design mock-up into static websites
High-Resolution Image Synthesis with Latent Diffusion Models
A Powerful Native Multimodal Model for Image Generation
Open-source image generative foundation model