Implementation of DeepLabCut
End-to-end pipeline converting generative videos
Code for running inference with the SAM 3D Body Model 3DB
[CVPR 2025 Best Paper Award] VGGT
A lightweight 3D Morphable Face Model library in modern C++
Models for object and human mesh reconstruction
Build Vision Agents quickly with any model or video provider
Diffusion Transformer with Fine-Grained Chinese Understanding
Advancing Open-source World Models
C++ and Python Examples
AI-data warehouse to enrich, transform and analyze unstructured data
RGBD video generation model conditioned on camera input
Qwen-Image is a powerful image generation foundation model
Code to accompany "A Method for Animating Children's Drawings"
Public opinion analysis system
Personalize Any Characters with a Scalable Diffusion Transformer
Pre-trained Deep Learning models and demos
Face recognition with deep neural networks
Fast image augmentation library and an easy-to-use wrapper
A CNN model that predicts human joints from RGB images of a person
Let us control diffusion models
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
A real-time approach for mapping all human pixels of 2D RGB images
A dataset of short, object-centric video clips
Gluon CV Toolkit