HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Visual Saliency Transformer

Nian Liu Ni Zhang Kaiyuan Wan Ling Shao Junwei Han

Visual Saliency Transformer

Abstract

Existing state-of-the-art saliency detection methods heavily rely on CNN-based architectures. Alternatively, we rethink this task from a convolution-free sequence-to-sequence perspective and predict saliency by modeling long-range dependencies, which can not be achieved by convolution. Specifically, we develop a novel unified model based on a pure transformer, namely, Visual Saliency Transformer (VST), for both RGB and RGB-D salient object detection (SOD). It takes image patches as inputs and leverages the transformer to propagate global contexts among image patches. Unlike conventional architectures used in Vision Transformer (ViT), we leverage multi-level token fusion and propose a new token upsampling method under the transformer framework to get high-resolution detection results. We also develop a token-based multi-task decoder to simultaneously perform saliency and boundary detection by introducing task-related tokens and a novel patch-task-attention mechanism. Experimental results show that our model outperforms existing methods on both RGB and RGB-D SOD benchmark datasets. Most importantly, our whole framework not only provides a new perspective for the SOD field but also shows a new paradigm for transformer-based dense prediction models. Code is available at https://github.com/nnizhang/VST.

Code Repositories

fhshen2022/prunerepaint
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
rgb-d-salient-object-detection-on-njudVST
S-Measure: 0.922
rgb-d-salient-object-detection-on-nlprVST
S-Measure: 0.932
rgb-d-salient-object-detection-on-sipVST
Average MAE: 0.040
S-Measure: 90.4
max E-Measure: 94.4
max F-Measure: 91.5
thermal-image-segmentation-on-rgb-t-glassVST
MAE: 0.044

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp