A GUI tool for extracting hard-coded subtitle (hardsub) from videos
AI tool that removes hardcoded subtitles and text from videos locally
State-of-the-art (SoTA) text-to-video pre-trained model
Implementation of Video Diffusion Models
Implementation of Make-A-Video, new SOTA text to video generator
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Video-based AI memory library. Store millions of text chunks in MP4
Cut videos with a text editor
text and image to video generation: CogVideoX (2024) and CogVideo
Implementation of Phenaki Video, which uses Mask GIT
Open-Sora: Democratizing Efficient Video Production for All
Official Python inference and LoRA trainer package
Capable of understanding text, audio, vision, video
Multimodal-Driven Architecture for Customized Video Generation
Official MiniMax Model Context Protocol (MCP) server
Synchronized Translation for Videos
Generate blog articles from video or audio
Translate the video from one language to another and embed dubbing
Qwen3-omni is a natively end-to-end, omni-modal LLM
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Build Vision Agents quickly with any model or video provider
Large Multimodal Models for Video Understanding and Editing
AI-powered tool for generating, optimizing, and translating subtitles
Code for running inference and finetuning with SAM 3 model