Visual tool for building, testing, and deploying AI agent workflows
Open source multimodal creative AI assistant with infinite canvas tool
A simple tool for reading in poorly redacted documents
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Skywork-R1V is an advanced multimodal AI model series
The book 5 of statistics in simplicity
AI Image Upscaler & Enhancer
Visual intelligence for your home.
Official Python inference and LoRA trainer package
Graph-based OSINT investigation platform w visual relationship mapping
Tiny vision language model
Official SeedVR2 Video Upscaler for ComfyUI
Turn WiFi signals into real-time human pose estimation and detection
RobotFramework support for Visual Studio Code
A state-of-the-art open visual language model
Data manipulation and transformation for audio signal processing
ASCII art library for Python
A framework to enable multimodal models to operate a computer
Edit videos with Claude Code
Parse files for optimal RAG
An extensive node suite that enables ComfyUI to process 3D inputs
LISA: Reasoning Segmentation via Large Language Model
"VideoRAG: Chat with Your Videos
GPT Image 2 prompt gallery, image prompt library, agentic skill
Recovering the Visual Space from Any Views