ShortGPT is an experimental AI-powered framework designed to automate the creation of short-form and long-form video content. It provides a structured system that handles multiple stages of the content creation workflow, including script generation, asset sourcing, voiceover synthesis, and video editing. ShortGPT uses large language models to generate scripts and prompts that guide the automated editing and production process. ShortGPT includes specialized content engines that manage different workflows, such as generating short videos, producing longer videos, and translating existing videos into other languages. It can automatically assemble videos by combining generated scripts, sourced media assets, captions, and synthesized voice narration. A modular editing system based on structured markup and JSON allows editing steps to be broken into manageable components that can be interpreted by language models.
Features
- Automated pipeline for generating and editing short-form video content
- AI-driven script generation using large language models
- Automatic sourcing of background images and video footage from media APIs
- Voiceover generation using multiple text-to-speech engines with multilingual support
- Structured editing framework using JSON-based editing workflows
- Video translation and dubbing system that can transcribe, translate, and revoice videos