code for Mesh R-CNN, ICCV 2019
A PyTorch library for implementing flow matching algorithms
Recovering the Visual Space from Any Views
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Native and Compact Structured Latents for 3D Generation
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Models for object and human mesh reconstruction
DeepMind model for tracking arbitrary points across videos & robotics
High-Fidelity and Controllable Generation of Textured 3D Assets
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Multimodal Transformer for document image understanding and layout