HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Shifting More Attention to Video Salient Object Detection

{ Jianbing Shen Ming-Ming Cheng Wenguan Wang Deng-Ping Fan}

Shifting More Attention to Video Salient Object Detection

Abstract

The last decade has witnessed a growing interest in video salient object detection (VSOD). However, the research community long-term lacked a well-established VSOD dataset representative of real dynamic scenes with high-quality annotations. To address this issue, we elaborately collected a visual-attention-consistent Densely Annotated VSOD (DAVSOD) dataset, which contains 226 videos with 23,938 frames that cover diverse realistic-scenes, objects, instances and motions. With corresponding real human eye-fixation data, we obtain precise ground-truths. This is the first work that explicitly emphasizes the challenge of saliency shift, i.e., the video salient object(s) may dynamically change. To further contribute the community a complete benchmark, we systematically assess 17 representative VSOD algorithms over seven existing VSOD datasets and our DAVSOD with totally 84K frames (largest-scale). Utilizing three famous metrics, we then present a comprehensive and insightful performance analysis. Furthermore, we propose a baseline model. It is equipped with a saliency shift- aware convLSTM, which can efficiently capture video saliency dynamics through learning human attention-shift behavior. Extensive experiments open up promising future directions for model development and comparison.

Benchmarks

BenchmarkMethodologyMetrics
video-salient-object-detection-on-davis-2016SSAV
AVERAGE MAE: 0.028
MAX E-MEASURE: 0.948
MAX F-MEASURE: 0.861
S-Measure: 0.893
video-salient-object-detection-on-davsodSSAV
Average MAE: 0.084
S-Measure: 0.755
max E-Measure: 0.806
max F-Measure: 0.659
video-salient-object-detection-on-davsod-1SSAV
Average MAE: 0.117
S-Measure: 0.661
max E-measure: 0.723
video-salient-object-detection-on-davsod-2SSAV
Average MAE: 0.114
S-Measure: 0.619
max E-measure: 0.696
video-salient-object-detection-on-fbms-59SSAV
AVERAGE MAE: 0.040
MAX E-MEASURE: 0.926
MAX F-MEASURE: 0.865
S-Measure: 0.879
video-salient-object-detection-on-mclSSAV
AVERAGE MAE: 0.026
MAX E-MEASURE: 0.889
MAX F-MEASURE: 0.773
S-Measure: 0.819
video-salient-object-detection-on-segtrack-v2SSAV
AVERAGE MAE: 0.023
MAX F-MEASURE: 0.801
S-Measure: 0.850
max E-measure: 0.917
video-salient-object-detection-on-uvsdSSAV
Average MAE: 0.025
S-Measure: 0.860
max E-measure: 0.939
video-salient-object-detection-on-visalSSAV
Average MAE: 0.021
S-Measure: 0.942
max E-measure: 0.980
video-salient-object-detection-on-vos-tSSAV
Average MAE: 0.074
S-Measure: 0.819
max E-measure: 0.839

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp