A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Real time face swap and one-click video deepfake
Image polygonal annotation with Python
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Text and image to video generation: CogVideoX and CogVideo
Implementation of Make-A-Video, new SOTA text to video generator
Multimodal-Driven Architecture for Customized Video Generation
AI video generator optimized for low VRAM and older GPUs use
Official Python inference and LoRA trainer package
A Customizable Image-to-Video Model based on HunyuanVideo
Modular AI image and video generation web UI with extensible tools
Industry leading face manipulation platform
All-in-one WebUI for AI generative image and video creation
Official MiniMax Model Context Protocol (MCP) server
Open-Sora: Democratizing Efficient Video Production for All
ComfyUI wrapper nodes for HunyuanVideo
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
RGBD video generation model conditioned on camera input
Generate high-definition story short videos with one click using AI
Director, Screenwriter, Producer, and Video Generator All-in-One
GPT4V-level open-source multi-modal model based on Llama3-8B
Capable of understanding text, audio, vision, video
Motion-controllable Video Generation via Latent Trajectory Guidance
Implementation of Phenaki Video, which uses Mask GIT