Implementation of Video Diffusion Models
Implementation of Make-A-Video, new SOTA text to video generator
Implementation of Phenaki Video, which uses Mask GIT
Implementation of a U-net complete with efficient attention
Implementation of Recurrent Interface Network (RIN)
InvokeAI is a leading creative engine for Stable Diffusion models
Data Lake for Deep Learning. Build, manage, and query datasets
CLIP + FFT/DWT/RGB = text to image/video
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
A walk along memory lane
Implementation of NÜWA, attention network for text to video synthesis
Generate images from texts. In Russian
Based on the Disco Diffusion, version of the AI art creation software
Implementation of NWT, audio-to-video generation, in Pytorch
The source code of CVPR 2019 paper "Deep Exemplar-based Colorization"
Software tool that converts text to video for more engaging experience
DCVGAN: Depth Conditional Video Generation, ICIP 2019.