VGGSfM: Visual Geometry Grounded Deep Structure From Motion
An experimental version of DeepSeek model
Python inference and LoRA trainer package for the LTX-2 audio–video
Code for running inference with the SAM 3D Body Model 3DB
Generating Immersive, Explorable, and Interactive 3D Worlds
Video understanding codebase from FAIR for reproducing video models
Analyze computation-communication overlap in V3/R1
code for Mesh R-CNN, ICCV 2019
Official PyTorch Implementation of "Scalable Diffusion Models"
A collection of high-quality models for the MuJoCo physics engine
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
Learning Continuous Signed Distance Functions for Shape Representation