Space-Time Video Super-Resolution via Multi-Scale Feature Interpolation and Temporal Feature Fusion

doi:10.21203/rs.3.rs-4342774/v1

Download PDF

Research Article

Space-Time Video Super-Resolution via Multi-Scale Feature Interpolation and Temporal Feature Fusion

https://doi.org/10.21203/rs.3.rs-4342774/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 10 Aug, 2024

Read the published version in Signal, Image and Video Processing →

You are reading this latest preprint version

The goal of Space-Time Video Super-Resolution (STVSR) is to simultaneously increase the spatial resolution and frame rate of low-resolution, low-frame-rate video. In response to the problem that the STVSR method does not fully consider the spatio-temporal correlation between successive video frames, which makes the video frame reconstruction results unsatisfactory, and the problem that the inference speed of large models is slow. This paper proposes a STVSR method based on Multi-Scale Feature Interpolation and Temporal Feature Fusion (MSITF). First, feature interpolation is performed in the low-resolution feature space to obtain the features corresponding to the missing frames. The feature is then enhanced using deformable convolution with the aim of obtaining a more accurate feature of the missing frames. Finally, the temporal alignment and global context learning of sequence frame features are performed by a temporal feature fusion module to fully extract and utilize the useful spatio-temporal information in adjacent frames, resulting in better quality of the reconstructed video frames. Extensive experiments on the benchmark datasets Vid4 and Vimeo-90k show that the proposed method achieves better qualitative and quantitative performance, with PSNR and SSIM on the Vid4 dataset improving by 0.8% and 1.9%, respectively, over the state-of-the-art two-stage method AdaCof+TTVSR, and MSITF improved by 1.2% and 2.5%, respectively, compared to single-stage method RSTT. The number of parameters decreased by 80.4% and 8.2% compared to the AdaCof+TTVSR and RSTT, respectively.

Space-Time Video Super-Resolution

Feature Interpolation

Video Frames Interpolation

Temporal Feature Fusion

No competing interests reported.

Download PDF

Journal Publication

published 10 Aug, 2024

Read the published version in Signal, Image and Video Processing →

Editorial decision: Revision requested
09 Jun, 2024
Reviews received at journal
09 Jun, 2024
Reviewers agreed at journal
26 May, 2024
Reviewers invited by journal
26 May, 2024
Editor assigned by journal
30 Apr, 2024
Submission checks completed at journal
30 Apr, 2024
First submitted to journal
29 Apr, 2024

You are reading this latest preprint version

Space-Time Video Super-Resolution via Multi-Scale Feature Interpolation and Temporal Feature Fusion

Status:

Journal Publication

Version 1

Abstract

Full Text

Additional Declarations

Status:

Journal Publication

Version 1