Command Palette
Search for a command to run...
Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection
{Zhongxuan Luo Huchuan Lu Jingjing Li Wei Ji Shunyu Yao Yongri Piao Yifei Wang Jie Liu Miao Zhang}

Abstract
The ability to capture inter-frame dynamics has been critical to the development of video salient object detection (VSOD). While many works have achieved great success in this field, a deeper insight into its dynamic nature should be developed. In this work, we aim to answer the following questions: How can a model adjust itself to dynamic variations as well as perceive fine differences in the real-world environment; How are the temporal dynamics well introduced into spatial information over time? To this end, we propose a dynamic context-sensitive filtering network (DCFNet) equipped with a dynamic context-sensitive filtering module (DCFM) and an effective bidirectional dynamic fusion strategy. The proposed DCFM sheds new light on dynamic filter generation by extracting location-related affinities between consecutive frames. Our bidirectional dynamic fusion strategy encourages the interaction of spatial and temporal information in a dynamic manner. Experimental results demonstrate that our proposed method can achieve state-of-the-art performance on most VSOD datasets while ensuring a real-time speed of 28 fps. The source code is publicly available at https://github.com/OIPLab-DUT/DCFNet.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| video-polyp-segmentation-on-sun-seg-easy | DCF | Dice: 0.325 S measure: 0.523 Sensitivity: 0.340 mean E-measure: 0.514 mean F-measure: 0.312 weighted F-measure: 0.270 |
| video-polyp-segmentation-on-sun-seg-hard | DCF | Dice: 0.317 S-Measure: 0.514 Sensitivity: 0.364 mean E-measure: 0.522 mean F-measure: 0.303 weighted F-measure: 0.263 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.