A state-of-the-art open visual language model
VMZ: Model Zoo for Video Modeling
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Chinese and English multimodal conversational language model
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Local AI file organization with image-based rename suggestions
We estimate dense, flicker-free, geometrically consistent depth
High-fidelity indoor 3D dataset for AI simulation and robotics