HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

DRCT: Saving Image Super-resolution away from Information Bottleneck

Chih-Chung Hsu Chia-Ming Lee Yi-Shiuan Chou

DRCT: Saving Image Super-resolution away from Information Bottleneck

Abstract

In recent years, Vision Transformer-based approaches for low-level vision tasks have achieved widespread success. Unlike CNN-based models, Transformers are more adept at capturing long-range dependencies, enabling the reconstruction of images utilizing non-local information. In the domain of super-resolution, Swin-transformer-based models have become mainstream due to their capability of global spatial information modeling and their shifting-window attention mechanism that facilitates the interchange of information between different windows. Many researchers have enhanced model performance by expanding the receptive fields or designing meticulous networks, yielding commendable results. However, we observed that it is a general phenomenon for the feature map intensity to be abruptly suppressed to small values towards the network's end. This implies an information bottleneck and a diminishment of spatial information, implicitly limiting the model's potential. To address this, we propose the Dense-residual-connected Transformer (DRCT), aimed at mitigating the loss of spatial information and stabilizing the information flow through dense-residual connections between layers, thereby unleashing the model's potential and saving the model away from information bottleneck. Experiment results indicate that our approach surpasses state-of-the-art methods on benchmark datasets and performs commendably at the NTIRE-2024 Image Super-Resolution (x4) Challenge. Our source code is available at https://github.com/ming053l/DRCT

Code Repositories

ming053l/drct
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-super-resolution-on-bsd100-2x-upscalingDRCT-L
PSNR: 32.90
SSIM: 0.9078
image-super-resolution-on-bsd100-2x-upscalingDRCT
PSNR: 32.75
SSIM: 0.9071
image-super-resolution-on-bsd100-4x-upscalingDRCT-L
PSNR: 28.16
SSIM: 0.7577
image-super-resolution-on-bsd100-4x-upscalingDRCT
PSNR: 28.06
SSIM: 0.7533
image-super-resolution-on-manga109-2xDRCT
PSNR: 40.41
SSIM: 0.9814
image-super-resolution-on-manga109-2xDRCT-L
PSNR: 41.14
SSIM: 0.9842
image-super-resolution-on-manga109-4xDRCT-L
PSNR: 33.14
SSIM: 0.9347
image-super-resolution-on-manga109-4xDRCT
PSNR: 32.96
SSIM: 0.9324
image-super-resolution-on-set14-2x-upscalingDRCT-L
PSNR: 35.36
SSIM: 0.9302
image-super-resolution-on-set14-2x-upscalingDRCT
PSNR: 34.96
SSIM: 0.9287
image-super-resolution-on-set14-4x-upscalingDRCT-L
PSNR: 29.54
SSIM: 0.8025
image-super-resolution-on-set14-4x-upscalingDRCT
PSNR: 29.40
SSIM: 0.8003
image-super-resolution-on-set5-2x-upscalingDRCT-L
PSNR: 39.14
SSIM: 0.9658
image-super-resolution-on-set5-2x-upscalingDRCT
PSNR: 38.72
SSIM: 0.9646
image-super-resolution-on-set5-3x-upscalingDRCT
PSNR: 35.18
SSIM: 0.9338
image-super-resolution-on-set5-3x-upscalingDRCT-L
PSNR: 35.32
SSIM: 0.9348
image-super-resolution-on-urban100-2xDRCT-L
PSNR: 35.17
SSIM: 0.9516
image-super-resolution-on-urban100-2xDRCT
PSNR: 34.54
SSIM: 0.9474
image-super-resolution-on-urban100-4xDRCT-L
PSNR: 28.70
SSIM: 0.8508
image-super-resolution-on-urban100-4xDRCT
PSNR: 28.40
SSIM: 0.8457

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp