Command Palette
Search for a command to run...
Oh Seoung Wug ; Lee Joon-Young ; Xu Ning ; Kim Seon Joo

Abstract
We propose a novel solution for semi-supervised video object segmentation. Bythe nature of the problem, available cues (e.g. video frame(s) with objectmasks) become richer with the intermediate predictions. However, the existingmethods are unable to fully exploit this rich source of information. We resolvethe issue by leveraging memory networks and learn to read relevant informationfrom all available sources. In our framework, the past frames with object masksform an external memory, and the current frame as the query is segmented usingthe mask information in the memory. Specifically, the query and the memory aredensely matched in the feature space, covering all the space-time pixellocations in a feed-forward fashion. Contrast to the previous approaches, theabundant use of the guidance information allows us to better handle thechallenges such as appearance changes and occlussions. We validate our methodon the latest benchmark sets and achieved the state-of-the-art performance(overall score of 79.4 on Youtube-VOS val set, J of 88.7 and 79.2 on DAVIS2016/2017 val set respectively) while having a fast runtime (0.16 second/frameon DAVIS 2016 val set).
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| interactive-video-object-segmentation-on | STM | AUC-Ju0026F: 0.803 Ju0026F@60s: 0.848 |
| semi-supervised-video-object-segmentation-on-1 | STM | F-measure (Decay): 17.5 F-measure (Mean): 75.2 F-measure (Recall): 83.0 Ju0026F: 72.2 Jaccard (Decay): 16.9 Jaccard (Mean): 69.3 Jaccard (Recall): 78.0 |
| semi-supervised-video-object-segmentation-on-20 | STM | D16 val (F): 88.1 D16 val (G): 86.5 D16 val (J): 84.8 D17 val (F): 74.0 D17 val (G): 71.6 D17 val (J): 69.2 FPS: 6.25 |
| video-object-segmentation-on-youtube-vos | STM | Overall: 68.2 |
| visual-object-tracking-on-davis-2016 | STM | F-measure (Decay): 4.2 F-measure (Mean): 90.1 F-measure (Recall): 95.2 Ju0026F: 89.4 Jaccard (Decay): 5.0 Jaccard (Mean): 88.7 Jaccard (Recall): 97.4 |
| visual-object-tracking-on-davis-2017 | STM | F-measure (Decay): 10.5 F-measure (Mean): 84.3 F-measure (Recall): 91.8 Ju0026F: 81.75 Jaccard (Decay): 8.0 Jaccard (Mean): 79.2 Jaccard (Recall): 88.7 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.