Video Object and Interaction Deletion
Open source demo platform where you can easily showcase your AI models
Effortless data labeling with AI support from Segment Anything
Label Studio is a multi-type data labeling and annotation tool
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Open source multimodal creative AI assistant with infinite canvas tool
Visual intelligence for your home.
Taming Stable Diffusion for Lip Sync
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Recovering the Visual Space from Any Views
LISA: Reasoning Segmentation via Large Language Model
Self-supervised visual learning using momentum contrast in PyTorch
PyTorch extensions for fast R&D prototyping and Kaggle farming
Generating Immersive, Explorable, and Interactive 3D Worlds
Gracefully face hCaptcha challenge with multimodal llms
General-purpose image editing model that delivers high-fidelity
computer vision projects | Fun AI projects related to computer vision
CS2, Valorant, Fortnite, APEX, every game
Implementation of Nougat Neural Optical Understanding
Constantly summarizing open source dataset and critical papers
Visual tracking library based on PyTorch
Evaluates the performance of your neural net for object recognition
Aims to enable researcher to tap in to mobile computing capability
Scripthea is designed to streamline of crafting prompts for T2I gen.