Structure-from-Motion and Multi-View Stereo
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Visual localization made easy with hloc
2D and 3D Face alignment library build using pytorch
Implementation of Make-A-Video, new SOTA text to video generator
Implementation of a U-net complete with efficient attention
A walk along memory lane
State-of-the-art diffusion models for image and audio generation
CLIP + FFT/DWT/RGB = text to image/video
The data structure for multimodal data
Build cross-modal and multimodal applications on the cloud
Geometric deep learning extension library for PyTorch
Notebooks, models and techniques for the generation of AI Art
Based on the Disco Diffusion, version of the AI art creation software
Web labeling tool for bitmap images and point clouds
Real-time multi-person keypoint detection library for body, face, etc.
An open-source convolutional neural networks platform for research
C++ library for image acquisition and visualization
JavaScript helpers for rendering high-resolution image variants
Automatic segmentation and tracking for 3D time-lapse microscopy
Image-based Vascular Analysis Toolkit