Wan2.2: Open and Advanced Large-Scale Video Generative Model
A python tool that uses GPT-4, FFmpeg, and OpenCV
Python inference and LoRA trainer package for the LTX-2 audio–video
Generate short videos with one click using AI LLM
Open-Sora: Democratizing Efficient Video Production for All
Wan2.1: Open and Advanced Large-Scale Video Generative Model
RGBD video generation model conditioned on camera input
LTX-Video Support for ComfyUI
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Official repository for LTX-Video
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
AI-powered video clipping and highlight generation
Implementation of Video Diffusion Models
Multimodal-Driven Architecture for Customized Video Generation
Implementation of Phenaki Video, which uses Mask GIT
Large Multimodal Models for Video Understanding and Editing
A Customizable Image-to-Video Model based on HunyuanVideo
Implementation of Recurrent Interface Network (RIN)
Generate high-definition story short videos with one click using AI
Implementation of Make-A-Video, new SOTA text to video generator
A Customizable Image-to-Video Model based on HunyuanVideo
Overcoming Data Limitations for High-Quality Video Diffusion Models
CLIP + FFT/DWT/RGB = text to image/video
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
A walk along memory lane