HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

ML-CrAIST: Multi-scale Low-high Frequency Information-based Cross black Attention with Image Super-resolving Transformer

Alik Pramanick Utsav Bheda Arijit Sur

ML-CrAIST: Multi-scale Low-high Frequency Information-based Cross black Attention with Image Super-resolving Transformer

Abstract

Recently, transformers have captured significant interest in the area of single-image super-resolution tasks, demonstrating substantial gains in performance. Current models heavily depend on the network's extensive ability to extract high-level semantic details from images while overlooking the effective utilization of multi-scale image details and intermediate information within the network. Furthermore, it has been observed that high-frequency areas in images present significant complexity for super-resolution compared to low-frequency areas. This work proposes a transformer-based super-resolution architecture called ML-CrAIST that addresses this gap by utilizing low-high frequency information in multiple scales. Unlike most of the previous work (either spatial or channel), we operate spatial and channel self-attention, which concurrently model pixel interaction from both spatial and channel dimensions, exploiting the inherent correlations across spatial and channel axis. Further, we devise a cross-attention block for super-resolution, which explores the correlations between low and high-frequency information. Quantitative and qualitative assessments indicate that our proposed ML-CrAIST surpasses state-of-the-art super-resolution methods (e.g., 0.15 dB gain @Manga109 $\times$4). Code is available on: https://github.com/Alik033/ML-CrAIST.

Code Repositories

alik033/ml-craist
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-super-resolution-on-2x-upscalingML-CrAIST
#params (K): 1259
FLOPs(G): 165.7
image-super-resolution-on-2x-upscalingML-CrAIST-Li
#params (K): 743
FLOPs(G): 97.2
image-super-resolution-on-3x-upscalingML-CrAIST
#params (K): 1268
FLOPs(G): 84.1
image-super-resolution-on-3x-upscalingML-CrAIST-Li
#params (K): 749
FLOPs(G): 49.6
image-super-resolution-on-4x-upscalingML-CrAIST-Li
#params (K): 758
FLOPs(G): 25.5
image-super-resolution-on-4x-upscalingML-CrAIST
#params (K): 1280
FLOPs(G): 42.9
image-super-resolution-on-b100-2x-upscalingML-CrAIST
SSIM: 0.9022
image-super-resolution-on-b100-2x-upscalingML-CrAIST-Li
PSNR: 32.36
SSIM: 0.902
image-super-resolution-on-b100-3x-upscalingML-CrAIST
SSIM: 0.8111
image-super-resolution-on-b100-3x-upscalingML-CrAIST-Li
PSNR: 29.28
SSIM: 0.8106
image-super-resolution-on-b100-4x-upscalingML-CrAIST
PSNR: 27.78
SSIM: 0.7446
image-super-resolution-on-b100-4x-upscalingML-CrAIST-Li
PSNR: 27.73
image-super-resolution-on-manga109-2xML-CrAIST
PSNR: 39.26
SSIM: 0.9786
image-super-resolution-on-manga109-2xML-CrAIST-Li
PSNR: 39.23
SSIM: 0.9785
image-super-resolution-on-manga109-3xML-CrAIST-Li
PSNR: 34.26
SSIM: 0.9492
image-super-resolution-on-manga109-3xML-CrAIST
PSNR: 34.42
SSIM: 0.9501
image-super-resolution-on-manga109-4xML-CrAIST
PSNR: 31.17
SSIM: 0.9176
image-super-resolution-on-manga109-4xML-CrAIST-Li
PSNR: 31.11
SSIM: 0.9162
image-super-resolution-on-set14-2x-upscalingML-CrAIST-Li
PSNR: 33.64
SSIM: 0.9213
image-super-resolution-on-set14-2x-upscalingML-CrAIST
PSNR: 33.77
SSIM: 0.922
image-super-resolution-on-set14-3x-upscalingML-CrAIST
PSNR: 30.39
SSIM: 0.8488
image-super-resolution-on-set14-3x-upscalingML-CrAIST-Li
PSNR: 30.23
SSIM: 0.8474
image-super-resolution-on-set14-4x-upscalingML-CrAIST-Li
PSNR: 28.4
SSIM: 0.7863
image-super-resolution-on-set14-4x-upscalingML-CrAIST
PSNR: 28.53
SSIM: 0.7895
image-super-resolution-on-set5-2x-upscalingML-CrAIST
PSNR: 38.19
SSIM: 0.9617
image-super-resolution-on-set5-2x-upscalingML-CrAIST-Li
PSNR: 38.15
SSIM: 0.9615
image-super-resolution-on-set5-3x-upscalingML-CrAIST
PSNR: 34.7
SSIM: 0.9302
image-super-resolution-on-set5-3x-upscalingML-CrAIST-Li
PSNR: 34.58
SSIM: 0.9294
image-super-resolution-on-set5-4x-upscalingML-CrAIST
PSNR: 32.36
SSIM: 0.8984
image-super-resolution-on-set5-4x-upscalingML-CrAIST-Li
PSNR: 32.15
SSIM: 0.8962
image-super-resolution-on-urban100-2xML-CrAIST-Li
PSNR: 32.93
SSIM: 0.9361
image-super-resolution-on-urban100-2xML-CrAIST
PSNR: 33.04
SSIM: 0.937
image-super-resolution-on-urban100-3xML-CrAIST-Li
PSNR: 28.73
SSIM: 0.8651
image-super-resolution-on-urban100-3xML-CrAIST
PSNR: 28.89
SSIM: 0.8676
image-super-resolution-on-urban100-4xML-CrAIST
PSNR: 26.68
SSIM: 0.8057
image-super-resolution-on-urban100-4xML-CrAIST-Li
PSNR: 26.53
SSIM: 0.8019

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp