Implementation of Make-A-Video, new SOTA text to video generator
Unsupervised Learning for Image Registration
DeepMind model for tracking arbitrary points across videos & robotics
A text-to-speech, speech-to-text and speech-to-speech library
A trainable PyTorch reproduction of AlphaFold 3
A collaboration friendly studio for NeRFs
Deep learning optimization library making distributed training easy
End-to-end pipeline converting generative videos
Synthetic data curation for post-training and data extraction
Framework for building neural networks
Implementation of Video Diffusion Models
Graph Neural Network Library for PyTorch
State-of-the-art diffusion models for image and audio generation
Deep learning optimization library: makes distributed training easy
Scientific Visualisation Made Easy
Generate 3D objects conditioned on text or images
Framework that is dedicated to making neural data processing
CLIP + FFT/DWT/RGB = text to image/video
Visual localization made easy with hloc
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
High-Resolution 3D Human Digitization from A Single Image
Point cloud diffusion for 3D model synthesis
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
GFPGAN aims at developing Practical Algorithms
A collection of high-quality models for the MuJoCo physics engine