Generating Immersive, Explorable, and Interactive 3D Worlds
Capable of understanding text, audio, vision, video
Implementation of "MobileCLIP" CVPR 2024
Reproduces results of "Fixing the train-test resolution discrepancy"
CLIP model fine-tuned for zero-shot fashion product classification