RGBD video generation model conditioned on camera input
Visual intelligence for your home.
Fast and Universal 3D reconstruction model for versatile tasks
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Photorealistic Synthetic Dataset for Holistic Indoor Scene
Sharp Monocular Metric Depth in Less Than a Second
Official Python inference and LoRA trainer package
NVR with realtime local object detection for IP cameras
Generate Any 3D Scene in Seconds
[CVPR 2025 Best Paper Award] VGGT
Lightweight Python library for adding real-time multi-object tracking
Advancing Open-source World Models
Tooling for the Common Objects In 3D dataset
Database system for building simpler and faster AI-powered application
High-Resolution 3D Human Digitization from A Single Image
Real-time face swap for PC streaming or video calls
A walk along memory lane
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Implementation of BEVFormer, a camera-only framework
A dataset of short, object-centric video clips
Deep learning gateway on Raspberry Pi and other edge devices
Hide screen when boss is approaching
World's simplest facial recognition api for Python & the command line