Implementation of Video Diffusion Models
RGBD video generation model conditioned on camera input
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
text and image to video generation: CogVideoX (2024) and CogVideo
Implementation of Recurrent Interface Network (RIN)
Overcoming Data Limitations for High-Quality Video Diffusion Models
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
AI-powered tool to quickly remove watermarks from videos flawlessly
CLIP + FFT/DWT/RGB = text to image/video
Implementation of NÜWA, attention network for text to video synthesis