RobotFramework support for Visual Studio Code
Turn WiFi signals into real-time human pose estimation and detection
A framework to enable multimodal models to operate a computer
Graph-based OSINT investigation platform w visual relationship mapping
Visual tool for building, testing, and deploying AI agent workflows
A state-of-the-art open visual language model
Suite of reference architectures for building GPU-accelerated vision
An extensive node suite that enables ComfyUI to process 3D inputs
StarVector is a foundation model for SVG generation
FEATool Multiphysics is an easy-to-use FEA and CFD Simulation Toolbox
Official Python inference and LoRA trainer package
Machine Learning, Criticism and Correction
Machine learning image inpainting task that removes watermarks
Edit videos with Claude Code
Visual intelligence for your home.
Video Object and Interaction Deletion
A Grub Theme in the style of Minecraft!
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
VMZ: Model Zoo for Video Modeling
Book_4_Matrix Power | The Iris Book: From Addition, Subtraction
"VideoRAG: Chat with Your Videos
AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Weaving the Digital Agent Galaxy
Recovering the Visual Space from Any Views