Generating Immersive, Explorable, and Interactive 3D Worlds
A Unified Framework for Image Customization
Capable of understanding text, audio, vision, video
Reproduces results of "Fixing the train-test resolution discrepancy"
We estimate dense, flicker-free, geometrically consistent depth
Torch implementation of DeepMask and SharpMask
CLIP model fine-tuned for zero-shot fashion product classification