Sharp Monocular Metric Depth in Less Than a Second
Convert AI papers to GUI
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Spring AI Alibaba examples for building and testing AI apps
GeoAI: Artificial Intelligence for Geospatial Data
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Advanced AI Explainability for computer vision
Official implementation of DreamCraft3D
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An extensive node suite that enables ComfyUI to process 3D inputs
AI-data warehouse to enrich, transform and analyze unstructured data
Scientific Visualisation Made Easy
Document Image Parsing via Heterogeneous Anchor Prompting”
Build cross-modal and multimodal applications on the cloud
Python SDK for the Computer Use model Lux, developed by OpenAGI
Large-language-model & vision-language-model based on Linear Attention
AI-powered tool to quickly remove watermarks from images flawlessly
Overcoming Data Limitations for High-Quality Video Diffusion Models
A Customizable Image-to-Video Model based on HunyuanVideo
AI Suite for upscaling, interpolating & restoring images/videos
An open source object detection toolbox based on PyTorch
OpenMMLab Model Deployment Framework
computer vision projects | Fun AI projects related to computer vision
OpenMMLab Image Classification Toolbox and Benchmark
Visual localization made easy with hloc