Command Palette
Search for a command to run...
{ Jianbing Shen Ming-Ming Cheng Wenguan Wang Deng-Ping Fan}

Abstract
The last decade has witnessed a growing interest in video salient object detection (VSOD). However, the research community long-term lacked a well-established VSOD dataset representative of real dynamic scenes with high-quality annotations. To address this issue, we elaborately collected a visual-attention-consistent Densely Annotated VSOD (DAVSOD) dataset, which contains 226 videos with 23,938 frames that cover diverse realistic-scenes, objects, instances and motions. With corresponding real human eye-fixation data, we obtain precise ground-truths. This is the first work that explicitly emphasizes the challenge of saliency shift, i.e., the video salient object(s) may dynamically change. To further contribute the community a complete benchmark, we systematically assess 17 representative VSOD algorithms over seven existing VSOD datasets and our DAVSOD with totally 84K frames (largest-scale). Utilizing three famous metrics, we then present a comprehensive and insightful performance analysis. Furthermore, we propose a baseline model. It is equipped with a saliency shift- aware convLSTM, which can efficiently capture video saliency dynamics through learning human attention-shift behavior. Extensive experiments open up promising future directions for model development and comparison.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| video-salient-object-detection-on-davis-2016 | SSAV | AVERAGE MAE: 0.028 MAX E-MEASURE: 0.948 MAX F-MEASURE: 0.861 S-Measure: 0.893 |
| video-salient-object-detection-on-davsod | SSAV | Average MAE: 0.084 S-Measure: 0.755 max E-Measure: 0.806 max F-Measure: 0.659 |
| video-salient-object-detection-on-davsod-1 | SSAV | Average MAE: 0.117 S-Measure: 0.661 max E-measure: 0.723 |
| video-salient-object-detection-on-davsod-2 | SSAV | Average MAE: 0.114 S-Measure: 0.619 max E-measure: 0.696 |
| video-salient-object-detection-on-fbms-59 | SSAV | AVERAGE MAE: 0.040 MAX E-MEASURE: 0.926 MAX F-MEASURE: 0.865 S-Measure: 0.879 |
| video-salient-object-detection-on-mcl | SSAV | AVERAGE MAE: 0.026 MAX E-MEASURE: 0.889 MAX F-MEASURE: 0.773 S-Measure: 0.819 |
| video-salient-object-detection-on-segtrack-v2 | SSAV | AVERAGE MAE: 0.023 MAX F-MEASURE: 0.801 S-Measure: 0.850 max E-measure: 0.917 |
| video-salient-object-detection-on-uvsd | SSAV | Average MAE: 0.025 S-Measure: 0.860 max E-measure: 0.939 |
| video-salient-object-detection-on-visal | SSAV | Average MAE: 0.021 S-Measure: 0.942 max E-measure: 0.980 |
| video-salient-object-detection-on-vos-t | SSAV | Average MAE: 0.074 S-Measure: 0.819 max E-measure: 0.839 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.