Recovering the Visual Space from Any Views
Comprehensive tutorial repository aimed at teaching the Python program
Mod manager for the video game RimWorld
Qwen3-omni is a natively end-to-end, omni-modal LLM
Segmentation models with pretrained backbones. PyTorch
We write your reusable computer vision tools
AI based photo editing website for changing image background
Automatically translates the text of a video based on a subtitle file
Public opinion analysis system
Smart video player and playlist manager.
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Download videos/channels/playlists from YouTube and many other sites
Python implementation of global optimization with gaussian processes
Qwen2.5-VL is the multimodal large language model series
Convert AI papers to GUI
Scientific Internet access
Implementation of Recurrent Interface Network (RIN)
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Multimodal embedding and reranking models built on Qwen3-VL
Code release for Cut and Learn for Unsupervised Object Detection
Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop
This Python library makes it easy to display images and videos
This is a background removing tool powered by InSPyReNet
Fantasy Premier League MCP Server