A Systematic Framework for Interactive World Modeling
Models for object and human mesh reconstruction
RGBD video generation model conditioned on camera input
Generating Immersive, Explorable, and Interactive 3D Worlds
Tooling for the Common Objects In 3D dataset
Fast and Universal 3D reconstruction model for versatile tasks
Benchmarking Multimodal Agents for Open-Ended Tasks
code for Mesh R-CNN, ICCV 2019
Generate Any 3D Scene in Seconds
Simple and easily configurable grid world environments
The data structure for multimodal data
DeepMind model for tracking arbitrary points across videos & robotics
Graph Neural Network Library for PyTorch