Build cross-modal and multimodal applications on the cloud
Generate 3D objects conditioned on text or images
Framework that is dedicated to making neural data processing
CLIP + FFT/DWT/RGB = text to image/video
2D and 3D Face alignment library build using pytorch
Visual localization made easy with hloc
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
High-Resolution 3D Human Digitization from A Single Image
A library for graph deep learning research
A walk along memory lane
Point cloud diffusion for 3D model synthesis
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Implementation of BEVFormer, a camera-only framework
Toolkit for developing and comparing reinforcement learning algorithms
A collection of high-quality models for the MuJoCo physics engine
pyntcloud is a Python library for working with 3D point clouds
Notebooks, models and techniques for the generation of AI Art
3D-aware GANs based on NeRF (arXiv)
Based on the Disco Diffusion, version of the AI art creation software
DeepImageTranslator: a deep-learning utility for image translation
Pytorch framework for doing deep learning on point clouds
A real-time approach for mapping all human pixels of 2D RGB images
A dataset of short, object-centric video clips
PyBullet Gymnasium environments for multi-agent reinforcement
Keras Temporal Convolutional Network