Token-Oriented Object Notation (TOON)
NVR with realtime local object detection for IP cameras
Models for object and human mesh reconstruction
Pluggable SOTA multi-object tracking modules for segmentation
Code release for Cut and Learn for Unsupervised Object Detection
Video Object and Interaction Deletion
RF-DETR is a real-time object detection and segmentation
NVR with realtime local object detection for IP cameras
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Gracefully face hCaptcha challenge with multimodal llms
Provides code for running inference with the SegmentAnything Model
Ultralytics YOLO
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
GeoAI: Artificial Intelligence for Geospatial Data
Interactive Machine Learning experiments
Open source demo platform where you can easily showcase your AI models
Visual intelligence for your home.
Python library and CLI tool to interface with Google Translate
Recovering the Visual Space from Any Views
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
AI memory OS for LLM and Agent systems
Declarative way to run AI models in React Native on device
Build Vision Agents quickly with any model or video provider
Cross-platform, customizable ML solutions
Official implementation of DreamCraft3D