HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Efficient Video Object Segmentation via Network Modulation

Linjie Yang; Yanran Wang; Xuehan Xiong; Jianchao Yang; Aggelos K. Katsaggelos

Efficient Video Object Segmentation via Network Modulation

Abstract

Video object segmentation targets at segmenting a specific object throughout a video sequence, given only an annotated first frame. Recent deep learning based approaches find it effective by fine-tuning a general-purpose segmentation model on the annotated frame using hundreds of iterations of gradient descent. Despite the high accuracy these methods achieve, the fine-tuning process is inefficient and fail to meet the requirements of real world applications. We propose a novel approach that uses a single forward pass to adapt the segmentation model to the appearance of a specific object. Specifically, a second meta neural network named modulator is learned to manipulate the intermediate layers of the segmentation network given limited visual and spatial information of the target object. The experiments show that our approach is 70times faster than fine-tuning approaches while achieving similar accuracy.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
one-shot-visual-object-segmentation-onOSMN
Jaccard (Seen): 60.0
semi-supervised-video-object-segmentation-on-1OSMN
F-measure (Decay): 17.4
F-measure (Recall): 47.4
Ju0026F: 41.3
Jaccard (Decay): 19.0
Jaccard (Mean): 37.7
Jaccard (Recall): 38.9
video-instance-segmentation-on-youtube-vis-1OSMN
AP50: 28.6
AP75: 33.1
mask AP: 29.1
video-object-segmentation-on-youtube-vosOSMN
F-Measure (Seen): 60.1
F-Measure (Unseen): 44.0
Jaccard (Seen): 60.0
Jaccard (Unseen): 40.6
Overall: 51.2
Speed (FPS): 7.14
visual-object-tracking-on-davis-2016OSMN
F-measure (Decay): 10.6
F-measure (Mean): 72.9
F-measure (Recall): 84.0
Ju0026F: 73.45
Jaccard (Decay): 9.0
Jaccard (Mean): 74.0
Jaccard (Recall): 87.6
visual-object-tracking-on-davis-2017OSMN
F-measure (Decay): 24.3
F-measure (Mean): 57.1
F-measure (Recall): 66.1
Ju0026F: 54.8
Jaccard (Decay): 21.5
Jaccard (Mean): 52.5
Jaccard (Recall): 60.9
visual-object-tracking-on-youtube-vosOSMN
F-Measure (Seen): 60.1
F-Measure (Unseen): 44.0
Jaccard (Seen): 60.0
O (Average of Measures): 51.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp