BasicSR
Winning Solution in NTIRE19 Challenges on Video Restoration
...Unlike single-image restoration models, EDVR addresses the temporal dimension by aligning multiple video frames using deformable convolutional layers in a coarse-to-fine manner, allowing it to effectively handle large motion and complex scene dynamics. The architecture includes bespoke modules (e.g., Pyramid, Cascading and Deformable alignment and Temporal Spatial Attention fusion) that align information across frames and emphasize important features for restoration, enabling state-of-the-art performance on benchmarks such as the REDS challenge. By fusing spatial and temporal context, EDVR produces significantly improved visual quality in restored videos compared with approaches that treat each frame independently.