HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Learning Position and Target Consistency for Memory-based Video Object Segmentation

Li Hu; Peng Zhang; Bang Zhang; Pan Pan; Yinghui Xu; Rong Jin

Learning Position and Target Consistency for Memory-based Video Object Segmentation

Abstract

This paper studies the problem of semi-supervised video object segmentation(VOS). Multiple works have shown that memory-based approaches can be effective for video object segmentation. They are mostly based on pixel-level matching, both spatially and temporally. The main shortcoming of memory-based approaches is that they do not take into account the sequential order among frames and do not exploit object-level knowledge from the target. To address this limitation, we propose to Learn position and target Consistency framework for Memory-based video object segmentation, termed as LCM. It applies the memory mechanism to retrieve pixels globally, and meanwhile learns position consistency for more reliable segmentation. The learned location response promotes a better discrimination between target and distractors. Besides, LCM introduces an object-level relationship from the target to maintain target consistency, making LCM more robust to error drifting. Experiments show that our LCM achieves state-of-the-art performance on both DAVIS and Youtube-VOS benchmark. And we rank the 1st in the DAVIS 2020 challenge semi-supervised VOS task.

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-video-object-segmentation-on-20LCM
D17 val (F): 77.2
D17 val (G): 75.2
D17 val (J): 73.1
FPS: 8.47

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Learning Position and Target Consistency for Memory-based Video Object Segmentation | Papers | HyperAI