Generating Immersive, Explorable, and Interactive 3D Worlds
Capable of understanding text, audio, vision, video
A Unified Framework for Image Customization
Label Studio is a multi-type data labeling and annotation tool
text and image to video generation: CogVideoX (2024) and CogVideo
Implementation of "MobileCLIP" CVPR 2024
21 Lessons, Get Started Building with Generative AI
Offline inference engine for art, real-time voice conversations
Convert AI papers to GUI
A lightweight vision library for performing large object detection
A desktop weather app powered by AI
Object detection architectures and models pretrained on the COCO data
Local image generation using VQGAN-CLIP or CLIP guided diffusion
Reproduces results of "Fixing the train-test resolution discrepancy"
We estimate dense, flicker-free, geometrically consistent depth
Composable GAN framework with api and user interface