Code for running inference with the SAM 3D Body Model 3DB
Tooling for the Common Objects In 3D dataset
Uncommon Objects in 3D dataset
Implementation of a U-net complete with efficient attention
Implementation of Make-A-Video, new SOTA text to video generator
Doom-based AI research platform for reinforcement learning
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
C++ library for image acquisition and visualization
Code that accompanies my blog post outlining five video classification
Cross Audio-Visual Recognition using 3D Architectures