HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Explicit Visual Prompting for Universal Foreground Segmentations

Liu Weihuang ; Shen Xi ; Pun Chi-Man ; Cun Xiaodong

Explicit Visual Prompting for Universal Foreground Segmentations

Abstract

Foreground segmentation is a fundamental problem in computer vision, whichincludes salient object detection, forgery detection, defocus blur detection,shadow detection, and camouflage object detection. Previous works havetypically relied on domain-specific solutions to address accuracy androbustness issues in those applications. In this paper, we present a unifiedframework for a number of foreground segmentation tasks without anytask-specific designs. We take inspiration from the widely-used pre-trainingand then prompt tuning protocols in NLP and propose a new visual promptingmodel, named Explicit Visual Prompting (EVP). Different from the previousvisual prompting which is typically a dataset-level implicit embedding, our keyinsight is to enforce the tunable parameters focusing on the explicit visualcontent from each individual image, i.e., the features from frozen patchembeddings and high-frequency components. Our method freezes a pre-trainedmodel and then learns task-specific knowledge using a few extra parameters.Despite introducing only a small number of tunable parameters, EVP achievessuperior performance than full fine-tuning and other parameter-efficientfine-tuning methods. Experiments in fourteen datasets across five tasks showthe proposed method outperforms other task-specific methods while beingconsiderably simple. The proposed method demonstrates the scalability indifferent architectures, pre-trained weights, and tasks. The code is availableat: https://github.com/NiFangBaAGe/Explicit-Visual-Prompt.

Code Repositories

nifangbaage/explicit-visual-prompt
Official
pytorch
Mentioned in GitHub
nifangbaage/explict-visual-prompt
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
camouflaged-object-segmentation-on-camoEVPv2
MAE: 0.058
S-Measure: 0.848
Weighted F-Measure: 0.786
camouflaged-object-segmentation-on-codEVPv2
MAE: 0.029
S-Measure: 0.843
Weighted F-Measure: 0.746
salient-object-detection-on-dut-omron-2EVPv2
E-measure: 0.895
MAE: 0.047
S-measure: 0.862
max_F1: 0.857
salient-object-detection-on-duts-te-1EVPv2
E-measure: 0.948
MAE: 0.027
Smeasure: 0.915
max_F1: 0.923
salient-object-detection-on-ecssd-1EVPv2
E-measure: 0.957
MAE: 0.028
S-measure: 0.935
max_F1: 0.958
salient-object-detection-on-pascal-s-1EVPv2
E-measure: 0.917
MAE: 0.053
S-measure: 0.879
max_F1: 0.869

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp