Recovering the Visual Space from Any Views
Convert various image, audio and video formats from your context menu.
NVR with realtime local object detection for IP cameras
We write your reusable computer vision tools
Automatically translates the text of a video based on a subtitle file
Segmentation models with pretrained backbones. PyTorch
Easy to use Python library for creating 2D arcade games
Comprehensive tutorial repository aimed at teaching the Python program
Sharp Monocular Metric Depth in Less Than a Second
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Qwen3-omni is a natively end-to-end, omni-modal LLM
AI based photo editing website for changing image background
Code release for Cut and Learn for Unsupervised Object Detection
Implementation of Recurrent Interface Network (RIN)
Scientific Internet access
Free Motion Capture for Everyone
Mod manager for the video game RimWorld
Download videos/channels/playlists from YouTube and many other sites
A light version of Debian with minimal installed using LXDE.
This Python library makes it easy to display images and videos
Python implementation of global optimization with gaussian processes
OCR expert VLM powered by Hunyuan's native multimodal architecture
Label Studio is a multi-type data labeling and annotation tool
Turns the YT Music site into a desktop application.
Convert AI papers to GUI