HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Local-Global Fusion Network for Video Super-Resolution

{Xinyi Peng Xianfang Sun Longcun Jin Hua Wang Dewei Su}

Abstract

The goal of video super-resolution technique is to address the problem of effectively restoring high-resolution (HR) videos from low-resolution (LR) ones. Previous methods commonly used optical flow to perform frame alignment and designed a framework from the perspective of space and time. However, inaccurate optical flow estimation may occur easily which leads to inferior restoration effects. In addition, how to effectively fuse the features of various video frames remains a challenging problem. In this paper, we propose a Local-Global Fusion Network (LGFN) to solve the above issues from a novel viewpoint. As an alternative to optical flow, deformable convolutions (DCs) with decreased multi-dilation convolution units (DMDCUs) are applied for efficient implicit alignment. Moreover, a structure with two branches, consisting of a Local Fusion Module (LFM) and a Global Fusion Module (GFM), is proposed to combine information from two different aspects. Specifically, LFM focuses on the relationship between adjacent frames and maintains the temporal consistency while GFM attempts to take advantage of all related features globally with a video shuffle strategy. Benefiting from our advanced network, experimental results on several datasets demonstrate that our LGFN can not only achieve comparative performance with state-of-the-art methods but also possess reliable ability on restoring a variety of video frames. The results on benchmark datasets of our LGFN are presented on https://github.com/BIOINSu/LGFN and the source code will be released as soon as the paper is accepted.

Benchmarks

BenchmarkMethodologyMetrics
video-super-resolution-on-msu-super-1LGFN + aomenc
BSQ-rate over ERQA: 14.631
BSQ-rate over LPIPS: 5.536
BSQ-rate over MS-SSIM: 4.321
BSQ-rate over PSNR: 9.79
BSQ-rate over VMAF: 1.99
video-super-resolution-on-msu-super-1LGFN + x264
BSQ-rate over ERQA: 1.704
BSQ-rate over LPIPS: 1.324
BSQ-rate over MS-SSIM: 0.77
BSQ-rate over PSNR: 1.151
BSQ-rate over VMAF: 0.744
video-super-resolution-on-msu-super-1LGFN + vvenc
BSQ-rate over ERQA: 18.342
BSQ-rate over LPIPS: 11.759
BSQ-rate over MS-SSIM: 0.889
BSQ-rate over PSNR: 5.768
BSQ-rate over Subjective Score: 2.944
BSQ-rate over VMAF: 1.626
video-super-resolution-on-msu-super-1LGFN + x265
BSQ-rate over ERQA: 13.213
BSQ-rate over LPIPS: 11.399
BSQ-rate over MS-SSIM: 1.533
BSQ-rate over PSNR: 6.646
BSQ-rate over VMAF: 1.341
video-super-resolution-on-msu-super-1LGFN + uavs3e
BSQ-rate over ERQA: 9.279
BSQ-rate over LPIPS: 4.504
BSQ-rate over MS-SSIM: 2.427
BSQ-rate over PSNR: 5.503
BSQ-rate over VMAF: 1.625
video-super-resolution-on-msu-video-upscalersLGFN
PSNR: 27.42
SSIM: 0.939
VMAF: 57.79
video-super-resolution-on-msu-vsr-benchmarkLGFN
1 - LPIPS: 0.903
ERQAv1.0: 0.74
FPS: 0.667
PSNR: 31.291
QRCRv1.0: 0.629
SSIM: 0.898
Subjective score: 6.505

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Local-Global Fusion Network for Video Super-Resolution | Papers | HyperAI