HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Video Object Segmentation using Space-Time Memory Networks

Oh Seoung Wug ; Lee Joon-Young ; Xu Ning ; Kim Seon Joo

Video Object Segmentation using Space-Time Memory Networks

Abstract

We propose a novel solution for semi-supervised video object segmentation. Bythe nature of the problem, available cues (e.g. video frame(s) with objectmasks) become richer with the intermediate predictions. However, the existingmethods are unable to fully exploit this rich source of information. We resolvethe issue by leveraging memory networks and learn to read relevant informationfrom all available sources. In our framework, the past frames with object masksform an external memory, and the current frame as the query is segmented usingthe mask information in the memory. Specifically, the query and the memory aredensely matched in the feature space, covering all the space-time pixellocations in a feed-forward fashion. Contrast to the previous approaches, theabundant use of the guidance information allows us to better handle thechallenges such as appearance changes and occlussions. We validate our methodon the latest benchmark sets and achieved the state-of-the-art performance(overall score of 79.4 on Youtube-VOS val set, J of 88.7 and 79.2 on DAVIS2016/2017 val set respectively) while having a fast runtime (0.16 second/frameon DAVIS 2016 val set).

Code Repositories

seoungwugoh/STM
pytorch
Mentioned in GitHub
hkchengrex/Mask-Propagation
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
interactive-video-object-segmentation-onSTM
AUC-Ju0026F: 0.803
Ju0026F@60s: 0.848
semi-supervised-video-object-segmentation-on-1STM
F-measure (Decay): 17.5
F-measure (Mean): 75.2
F-measure (Recall): 83.0
Ju0026F: 72.2
Jaccard (Decay): 16.9
Jaccard (Mean): 69.3
Jaccard (Recall): 78.0
semi-supervised-video-object-segmentation-on-20STM
D16 val (F): 88.1
D16 val (G): 86.5
D16 val (J): 84.8
D17 val (F): 74.0
D17 val (G): 71.6
D17 val (J): 69.2
FPS: 6.25
video-object-segmentation-on-youtube-vosSTM
Overall: 68.2
visual-object-tracking-on-davis-2016STM
F-measure (Decay): 4.2
F-measure (Mean): 90.1
F-measure (Recall): 95.2
Ju0026F: 89.4
Jaccard (Decay): 5.0
Jaccard (Mean): 88.7
Jaccard (Recall): 97.4
visual-object-tracking-on-davis-2017STM
F-measure (Decay): 10.5
F-measure (Mean): 84.3
F-measure (Recall): 91.8
Ju0026F: 81.75
Jaccard (Decay): 8.0
Jaccard (Mean): 79.2
Jaccard (Recall): 88.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp