Sharp Monocular Metric Depth in Less Than a Second
Recovering the Visual Space from Any Views
Multimodal embedding and reranking models built on Qwen3-VL
A Unified Framework for Text-to-3D and Image-to-3D Generation
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Let us control diffusion models
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201