An extensive node suite that enables ComfyUI to process 3D inputs
Generating Immersive, Explorable, and Interactive 3D Worlds
Recovering the Visual Space from Any Views
PyTorch3D is FAIR's library of reusable components for deep learning
Unifying 3D Mesh Generation with Language Models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Benchmarking Multimodal Agents for Open-Ended Tasks
SAPIEN Manipulation Skill Framework
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Python inference and LoRA trainer package for the LTX-2 audio–video
End-to-end pipeline converting generative videos
PaddlePaddle End-to-End Development Toolkit
Modular quant framework
Visual localization made easy with hloc
A library for graph deep learning research
Implementation of BEVFormer, a camera-only framework
Machine learning glossary
A real-time approach for mapping all human pixels of 2D RGB images
Cross Audio-Visual Recognition using 3D Architectures