AI tool that removes hardcoded subtitles and text from videos locally
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Implementation of Video Diffusion Models
AI Image Upscaler & Enhancer
A Multi-Modal World Model for Reconstructing, Generating, Simulation
A python tool that uses GPT-4, FFmpeg, and OpenCV
Open source multimodal creative AI assistant with infinite canvas tool
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Image/video AI upscaler app (BSRGAN)
Text and image to video generation: CogVideoX and CogVideo
Implementation of Phenaki Video, which uses Mask GIT
AI framework for automated short video creation and editing tools
Generate short videos with one click using AI LLM
Industry leading face manipulation platform
Code for running inference and finetuning with SAM 3 model
This Python library makes it easy to display images and videos
RGBD video generation model conditioned on camera input
Wan2.2: Open and Advanced Large-Scale Video Generative Model
AI based photo editing website for changing image background
NSFW Windows app to batch download images and videos
Follow along with my AI Agents Masterclass videos
AI-powered video clipping and highlight generation
Multimodal-Driven Architecture for Customized Video Generation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
VGGSfM: Visual Geometry Grounded Deep Structure From Motion