Open Source Differentiable Computer Vision Library
Build cross-modal and multimodal applications on the cloud
Implementation of a U-net complete with efficient attention
Implementation of Make-A-Video, new SOTA text to video generator
Implementation of Video Diffusion Models
Generate 3D objects conditioned on text or images
Framework that is dedicated to making neural data processing
CLIP + FFT/DWT/RGB = text to image/video
Visual localization made easy with hloc
Official Python Implementation for "3D Multi-Object Tracking
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
High-Resolution 3D Human Digitization from A Single Image
A library for graph deep learning research
A walk along memory lane
Point cloud diffusion for 3D model synthesis
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Toolkit for developing and comparing reinforcement learning algorithms
Implementation of BEVFormer, a camera-only framework
A collection of high-quality models for the MuJoCo physics engine
pyntcloud is a Python library for working with 3D point clouds
Notebooks, models and techniques for the generation of AI Art
3D-aware GANs based on NeRF (arXiv)
Based on the Disco Diffusion, version of the AI art creation software
Convolutional Neural Network for 3D meshes in PyTorch
Pytorch framework for doing deep learning on point clouds