Command Palette
Search for a command to run...
Liu Weihuang ; Shen Xi ; Pun Chi-Man ; Cun Xiaodong

Abstract
Foreground segmentation is a fundamental problem in computer vision, whichincludes salient object detection, forgery detection, defocus blur detection,shadow detection, and camouflage object detection. Previous works havetypically relied on domain-specific solutions to address accuracy androbustness issues in those applications. In this paper, we present a unifiedframework for a number of foreground segmentation tasks without anytask-specific designs. We take inspiration from the widely-used pre-trainingand then prompt tuning protocols in NLP and propose a new visual promptingmodel, named Explicit Visual Prompting (EVP). Different from the previousvisual prompting which is typically a dataset-level implicit embedding, our keyinsight is to enforce the tunable parameters focusing on the explicit visualcontent from each individual image, i.e., the features from frozen patchembeddings and high-frequency components. Our method freezes a pre-trainedmodel and then learns task-specific knowledge using a few extra parameters.Despite introducing only a small number of tunable parameters, EVP achievessuperior performance than full fine-tuning and other parameter-efficientfine-tuning methods. Experiments in fourteen datasets across five tasks showthe proposed method outperforms other task-specific methods while beingconsiderably simple. The proposed method demonstrates the scalability indifferent architectures, pre-trained weights, and tasks. The code is availableat: https://github.com/NiFangBaAGe/Explicit-Visual-Prompt.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| camouflaged-object-segmentation-on-camo | EVPv2 | MAE: 0.058 S-Measure: 0.848 Weighted F-Measure: 0.786 |
| camouflaged-object-segmentation-on-cod | EVPv2 | MAE: 0.029 S-Measure: 0.843 Weighted F-Measure: 0.746 |
| salient-object-detection-on-dut-omron-2 | EVPv2 | E-measure: 0.895 MAE: 0.047 S-measure: 0.862 max_F1: 0.857 |
| salient-object-detection-on-duts-te-1 | EVPv2 | E-measure: 0.948 MAE: 0.027 Smeasure: 0.915 max_F1: 0.923 |
| salient-object-detection-on-ecssd-1 | EVPv2 | E-measure: 0.957 MAE: 0.028 S-measure: 0.935 max_F1: 0.958 |
| salient-object-detection-on-pascal-s-1 | EVPv2 | E-measure: 0.917 MAE: 0.053 S-measure: 0.879 max_F1: 0.869 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.