Code for running inference with the SAM 3D Body Model 3DB
Models for object and human mesh reconstruction
High-Fidelity and Controllable Generation of Textured 3D Assets
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Native and Compact Structured Latents for 3D Generation
From Images to High-Fidelity 3D Assets
Tool for exploring and debugging transformer model behaviors
code for Mesh R-CNN, ICCV 2019
RGBD video generation model conditioned on camera input
Fast and Universal 3D reconstruction model for versatile tasks
A Unified Framework for Text-to-3D and Image-to-3D Generation
A Multi-Modal World Model for Reconstructing, Generating, Simulation
HY-Motion model for 3D character animation generation
Generating Immersive, Explorable, and Interactive 3D Worlds
Official implementation of DreamCraft3D
Sharp Monocular Metric Depth in Less Than a Second
Recovering the Visual Space from Any Views
Project Lyra: Open Generative 3D World Models
Generate Any 3D Scene in Seconds
State-of-the-art (SoTA) text-to-video pre-trained model
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
DeepMind model for tracking arbitrary points across videos & robotics
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
A collection of high-quality models for the MuJoCo physics engine