HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection

Chenxi Xie; Changqun Xia; Mingcan Ma; Zhirui Zhao; Xiaowu Chen; Jia Li

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection

Abstract

Recent salient object detection (SOD) methods based on deep neural network have achieved remarkable performance. However, most of existing SOD models designed for low-resolution input perform poorly on high-resolution images due to the contradiction between the sampling depth and the receptive field size. Aiming at resolving this contradiction, we propose a novel one-stage framework called Pyramid Grafting Network (PGNet), using transformer and CNN backbone to extract features from different resolution images independently and then graft the features from transformer branch to CNN branch. An attention-based Cross-Model Grafting Module (CMGM) is proposed to enable CNN branch to combine broken detailed information more holistically, guided by different source feature during decoding process. Moreover, we design an Attention Guided Loss (AGL) to explicitly supervise the attention matrix generated by CMGM to help the network better interact with the attention from different models. We contribute a new Ultra-High-Resolution Saliency Detection dataset UHRSD, containing 5,920 images at 4K-8K resolutions. To our knowledge, it is the largest dataset in both quantity and resolution for high-resolution SOD task, which can be used for training and testing in future research. Sufficient experiments on UHRSD and widely-used SOD datasets demonstrate that our method achieves superior performance compared to the state-of-the-art methods.

Code Repositories

icvteam/pgnet
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
rgb-salient-object-detection-on-davis-sPGNet
F-measure: 0.931
MAE: 0.015
S-measure: 0.935
mBA: 0.707
rgb-salient-object-detection-on-davis-sPGNet (DUTS, HRSOD)
F-measure: 0.948
MAE: 0.012
S-measure: 0.947
mBA: 0.716
rgb-salient-object-detection-on-davis-sPGNet (HRSOD, UHRSD)
F-measure: 0.956
MAE: 0.010
S-measure: 0.954
mBA: 0.730
rgb-salient-object-detection-on-hrsodPGNet (DUTS, HRSOD)
MAE: 0.020
S-Measure: 0.935
mBA: 0.714
max F-Measure: 0.929
rgb-salient-object-detection-on-hrsodPGNet
MAE: 0.021
S-Measure: 0.930
mBA: 0.693
max F-Measure: 0.922
rgb-salient-object-detection-on-hrsodPGNet (HRSOD, UHRSD)
MAE: 0.020
S-Measure: 0.938
mBA: 0.727
max F-Measure: 0.939
rgb-salient-object-detection-on-uhrsdPGNet (DUTS, HRSOD)
MAE: 0.036
S-Measure: 0.912
mBA: 0.735
max F-Measure: 0.915
rgb-salient-object-detection-on-uhrsdPGNet (HRSOD, UHRSD)
MAE: 0.026
S-Measure: 0.935
mBA: 0.765
max F-Measure: 0.930
rgb-salient-object-detection-on-uhrsdPGNet
MAE: 0.037
S-Measure: 0.912
mBA: 0.715
max F-Measure: 0.914

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp