Wan2.1: Open and Advanced Large-Scale Video Generative Model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Text and image to video generation: CogVideoX and CogVideo
Multimodal-Driven Architecture for Customized Video Generation
Implementation of Phenaki Video, which uses Mask GIT
A Customizable Image-to-Video Model based on HunyuanVideo
RGBD video generation model conditioned on camera input
Implementation of Make-A-Video, new SOTA text to video generator
Implementation of Video Diffusion Models
Overcoming Data Limitations for High-Quality Video Diffusion Models
A Customizable Image-to-Video Model based on HunyuanVideo
Implementation of Recurrent Interface Network (RIN)
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
A walk along memory lane
The leading software for creating deepfakes