DeepMind model for tracking arbitrary points across videos & robotics
Uncommon Objects in 3D dataset
Official implementation of DreamCraft3D
code for Mesh R-CNN, ICCV 2019
Fast and Universal 3D reconstruction model for versatile tasks
Pretrained time-series foundation model developed by Google Research
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Qwen2.5-VL is the multimodal large language model series
Tooling for the Common Objects In 3D dataset
RGBD video generation model conditioned on camera input
Large-scale autoregressive pixel model for image generation by OpenAI
Elegant PyTorch implementation of paper Model-Agnostic Meta-Learning
Code for the paper "Improved Techniques for Training GANs"