Command Palette
Search for a command to run...
Patch-Depth Fusion: Dichotomous Image Segmentation via Fine-Grained Patch Strategy and Depth Integrity-Prior
Liu Xianjie Fu Keren Zhao Qijun

Abstract
Dichotomous Image Segmentation (DIS) is a high-precision object segmentationtask for high-resolution natural images. The current mainstream methods focuson the optimization of local details but overlook the fundamental challenge ofmodeling the integrity of objects. We have found that the depth integrity-priorimplicit in the the pseudo-depth maps generated by Depth Anything Model v2 andthe local detail features of image patches can jointly address the abovedilemmas. Based on the above findings, we have designed a novel Patch-DepthFusion Network (PDFNet) for high-precision dichotomous image segmentation. Thecore of PDFNet consists of three aspects. Firstly, the object perception isenhanced through multi-modal input fusion. By utilizing the patch fine-grainedstrategy, coupled with patch selection and enhancement, the sensitivity todetails is improved. Secondly, by leveraging the depth integrity-priordistributed in the depth maps, we propose an integrity-prior loss to enhancethe uniformity of the segmentation results in the depth maps. Finally, weutilize the features of the shared encoder and, through a simple depthrefinement decoder, improve the ability of the shared encoder to capture subtledepth-related information in the images. Experiments on the DIS-5K dataset showthat PDFNet significantly outperforms state-of-the-art non-diffusion methods.Due to the incorporation of the depth integrity-prior, PDFNet achieves or evensurpassing the performance of the latest diffusion-based methods while usingless than 11% of the parameters of diffusion-based methods. The source code athttps://github.com/Tennine2077/PDFNet
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| dichotomous-image-segmentation-on-dis-te1 | PDFNet | E-measure: 0.927 MAE: 0.031 S-Measure: 0.899 max F-Measure: 0.890 weighted F-measure: 0.846 |
| dichotomous-image-segmentation-on-dis-te2 | PDFNet | E-measure: 0.947 MAE: 0.028 S-Measure: 0.924 max F-Measure: 0.921 weighted F-measure: 0.885 |
| dichotomous-image-segmentation-on-dis-te3 | PDFNet | E-measure: 0.957 MAE: 0.027 S-Measure: 0.928 max F-Measure: 0.936 weighted F-measure: 0.900 |
| dichotomous-image-segmentation-on-dis-te4 | PDFNet | E-measure: 0.941 MAE: 0.037 S-Measure: 0.910 max F-Measure: 0.911 weighted F-measure: 0.867 |
| dichotomous-image-segmentation-on-dis-vd | PDFNet | E-measure: 0.944 MAE: 0.030 S-Measure: 0.916 max F-Measure: 0.913 weighted F-measure: 0.873 |
| rgb-salient-object-detection-on-hrsod | PDFNet (HRSOD,UHRSD) | MAE: 0.012 S-Measure: 0.963 max F-Measure: 0.965 |
| rgb-salient-object-detection-on-uhrsd | PDFNet (HRSOD, UHRSD) | MAE: 0.019 S-Measure: 0.953 max F-Measure: 0.963 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.