Upscale-A-Video is a diffusion-based video super-resolution project from the CVPR 2024 Highlight paper “Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution.” It upscales low-resolution videos while using text prompts to guide the enhancement process. The model is designed for real-world videos where compression artifacts, blur, aging, or generated-video defects can make ordinary upscaling less reliable. The repository includes inference code, example inputs, configuration structure, pretrained model instructions, and optional LLaVA-assisted prompt support. It includes example workflows for AIGC videos, old videos, movies, and animations. It also provides color correction options to help reduce visual mismatch between the input and enhanced output.
Features
- Diffusion-based video upscaling
- Text-prompt-guided enhancement
- Temporal consistency focus
- AIGC and old video examples
- AdaIN and Wavelet color fixes
- Optional LLaVA prompt support