Stable Virtual Camera: Generative View Synthesis with Diffusion Models
NVR with realtime local object detection for IP cameras
Visual intelligence for your home.
RGBD video generation model conditioned on camera input
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Fast and Universal 3D reconstruction model for versatile tasks
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Open source driver assistance system
Official Python inference and LoRA trainer package
Photorealistic Synthetic Dataset for Holistic Indoor Scene
Diffusion Transformer with Fine-Grained Chinese Understanding
Generate Any 3D Scene in Seconds
Sharp Monocular Metric Depth in Less Than a Second
Uncommon Objects in 3D dataset
Lightweight Python library for adding real-time multi-object tracking
Advancing Open-source World Models
A Unified Framework for Image Customization
Tooling for the Common Objects In 3D dataset
[CVPR 2025 Best Paper Award] VGGT
Open Source Differentiable Computer Vision Library
An efficient forwarding service designed for LLMs
Database system for building simpler and faster AI-powered application
High-Resolution 3D Human Digitization from A Single Image
Real-time face swap for PC streaming or video calls