Open source demo platform where you can easily showcase your AI models
Video Object and Interaction Deletion
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Visual intelligence for your home.
Recovering the Visual Space from Any Views
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
LISA: Reasoning Segmentation via Large Language Model
Interactive Machine Learning experiments
Gracefully face hCaptcha challenge with multimodal llms
General-purpose image editing model that delivers high-fidelity
Framework & GUI for Bayes Nets and other probabilistic models.
computer vision projects | Fun AI projects related to computer vision
CS2, Valorant, Fortnite, APEX, every game
simple algorithm for a realtime interactive visual cortex for painting
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
An advanced bilingual image editing with semantic control