Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Visual intelligence for your home.
RGBD video generation model conditioned on camera input
Fast and Universal 3D reconstruction model for versatile tasks
Photorealistic Synthetic Dataset for Holistic Indoor Scene
Lightweight Python library for adding real-time multi-object tracking
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Official Python inference and LoRA trainer package
Sharp Monocular Metric Depth in Less Than a Second
NVR with realtime local object detection for IP cameras
Generate Any 3D Scene in Seconds
Advancing Open-source World Models
Open Source Differentiable Computer Vision Library
[CVPR 2025 Best Paper Award] VGGT
Database system for building simpler and faster AI-powered application
High-Resolution 3D Human Digitization from A Single Image
Real-time face swap for PC streaming or video calls
A walk along memory lane
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Implementation of BEVFormer, a camera-only framework
Object detection architectures and models pretrained on the COCO data
Face Recognition based Attendance System for school, college...
A dataset of short, object-centric video clips
Deep learning gateway on Raspberry Pi and other edge devices
Foot traffic and facial analytics for your business and home