End-to-end pipeline converting generative videos
Implementation of DeepLabCut
Code for running inference with the SAM 3D Body Model 3DB
[CVPR 2025 Best Paper Award] VGGT
Models for object and human mesh reconstruction
Build Vision Agents quickly with any model or video provider
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Diffusion Transformer with Fine-Grained Chinese Understanding
Advancing Open-source World Models
AI-data warehouse to enrich, transform and analyze unstructured data
RGBD video generation model conditioned on camera input
Qwen-Image is a powerful image generation foundation model
Public opinion analysis system
Code to accompany "A Method for Animating Children's Drawings"
Personalize Any Characters with a Scalable Diffusion Transformer
Pre-trained Deep Learning models and demos
Fast image augmentation library and an easy-to-use wrapper
Automated Face Blurring, Kinematics Extraction and Leg dystonia Dx
Let us control diffusion models
Official Code for DragGAN (SIGGRAPH 2023)
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
A real-time approach for mapping all human pixels of 2D RGB images
A dataset of short, object-centric video clips
Gluon CV Toolkit
Efficient 3D human pose estimation in video using 2D keypoint