Real time face swap and one-click video deepfake
iOS/Android image picker with support for camera, video, etc.
Image polygonal annotation with Python
Implementation of Make-A-Video, new SOTA text to video generator
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Text and image to video generation: CogVideoX and CogVideo
A Customizable Image-to-Video Model based on HunyuanVideo
Crafting engine for artists, designers, and filmmakers
AI video generator optimized for low VRAM and older GPUs use
Multimodal-Driven Architecture for Customized Video Generation
Official Python inference and LoRA trainer package
Modular AI image and video generation web UI with extensible tools
Industry leading face manipulation platform
ComfyUI wrapper nodes for HunyuanVideo
RGBD video generation model conditioned on camera input
Official MiniMax Model Context Protocol (MCP) server
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
All-in-one WebUI for AI generative image and video creation
Open-Sora: Democratizing Efficient Video Production for All
GPT4V-level open-source multi-modal model based on Llama3-8B
Implementation of Phenaki Video, which uses Mask GIT
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
ComfyUI wrapper nodes for WanVideo and related models
Generate high-definition story short videos with one click using AI