MapAnything: Universal Feed-Forward Metric 3D Reconstruction
DeepMind model for tracking arbitrary points across videos & robotics
End-to-end pipeline converting generative videos
Unsupervised Learning for Image Registration
Deep learning optimization library making distributed training easy
Framework for building neural networks
Graph Neural Network Library for PyTorch
A text-to-speech, speech-to-text and speech-to-speech library
State-of-the-art diffusion models for image and audio generation
Deep learning optimization library: makes distributed training easy
Implementation of Make-A-Video, new SOTA text to video generator
Implementation of Video Diffusion Models
Generate 3D objects conditioned on text or images
Framework that is dedicated to making neural data processing
CLIP + FFT/DWT/RGB = text to image/video
Visual localization made easy with hloc
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
High-Resolution 3D Human Digitization from A Single Image
Point cloud diffusion for 3D model synthesis
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
A collection of high-quality models for the MuJoCo physics engine
3D-aware GANs based on NeRF (arXiv)
Convolutional Neural Network for 3D meshes in PyTorch
Pytorch framework for doing deep learning on point clouds
A real-time approach for mapping all human pixels of 2D RGB images