Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Text and image to video generation: CogVideoX and CogVideo
Multimodal-Driven Architecture for Customized Video Generation
Implementation of Phenaki Video, which uses Mask GIT
Official Python inference and LoRA trainer package
Open-Sora: Democratizing Efficient Video Production for All
Crafting engine for artists, designers, and filmmakers
Implementation of Video Diffusion Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Large Multimodal Models for Video Understanding and Editing
Implementation of Make-A-Video, new SOTA text to video generator
A python tool that uses GPT-4, FFmpeg, and OpenCV
Text To Video Synthesis Colab
Overcoming Data Limitations for High-Quality Video Diffusion Models
Visual AI Workflow Builder
CLIP + FFT/DWT/RGB = text to image/video
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
A walk along memory lane
Implementation of NÜWA, attention network for text to video synthesis
Software tool that converts text to video for more engaging experience