VGGT-Ω
[CVPR 2026 Oral] VGGT Omega
VGGT-Omega is a Facebook Research computer vision project for feed-forward camera and depth reconstruction. It takes images as input and predicts camera parameters, depth maps, confidence values, and related scene tokens. The project is associated with 3D understanding workflows where models infer scene geometry without a traditional multi-stage reconstruction pipeline. It includes pretrained model variants with different resolutions and text-alignment capabilities, though checkpoint access may require approval. The repository also provides a Gradio demo that can visualize predicted cameras and depth-unprojected point clouds as a GLB scene. ...