Real time face swap and one-click video deepfake
Image polygonal annotation with Python
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Text and image to video generation: CogVideoX and CogVideo
Implementation of Make-A-Video, new SOTA text to video generator
Crafting engine for artists, designers, and filmmakers
Multimodal-Driven Architecture for Customized Video Generation
A Customizable Image-to-Video Model based on HunyuanVideo
AI video generator optimized for low VRAM and older GPUs use
Official Python inference and LoRA trainer package
Modular AI image and video generation web UI with extensible tools
Industry leading face manipulation platform
All-in-one WebUI for AI generative image and video creation
Official MiniMax Model Context Protocol (MCP) server
Open-Sora: Democratizing Efficient Video Production for All
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
RGBD video generation model conditioned on camera input
Generate high-definition story short videos with one click using AI
Director, Screenwriter, Producer, and Video Generator All-in-One
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
GPT4V-level open-source multi-modal model based on Llama3-8B
Capable of understanding text, audio, vision, video
Motion-controllable Video Generation via Latent Trajectory Guidance
Implementation of Phenaki Video, which uses Mask GIT