Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Implementation of Video Diffusion Models
Implementation of Make-A-Video, new SOTA text to video generator
Implementation of Phenaki Video, which uses Mask GIT
A walk along memory lane
Implementation of Recurrent Interface Network (RIN)
Implementation of NÜWA, attention network for text to video synthesis
CLIP + FFT/DWT/RGB = text to image/video
Implementation of NWT, audio-to-video generation, in Pytorch
Software tool that converts text to video for more engaging experience
DCVGAN: Depth Conditional Video Generation, ICIP 2019.