RGBD video generation model conditioned on camera input
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Fast and Universal 3D reconstruction model for versatile tasks
Open source driver assistance system
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Sharp Monocular Metric Depth in Less Than a Second
Uncommon Objects in 3D dataset
Photorealistic Synthetic Dataset for Holistic Indoor Scene
NVR with realtime local object detection for IP cameras
Generate Any 3D Scene in Seconds
Advancing Open-source World Models
Lightweight Python library for adding real-time multi-object tracking
Tooling for the Common Objects In 3D dataset
Open Source Differentiable Computer Vision Library
[CVPR 2025 Best Paper Award] VGGT
Database system for building simpler and faster AI-powered application
High-Resolution 3D Human Digitization from A Single Image
Real-time face swap for PC streaming or video calls
A walk along memory lane
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Implementation of BEVFormer, a camera-only framework
Object detection architectures and models pretrained on the COCO data
A dataset of short, object-centric video clips
Deep learning gateway on Raspberry Pi and other edge devices