HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation

Paul Voigtlaender; Yuning Chai; Florian Schroff; Hartwig Adam; Bastian Leibe; Liang-Chieh Chen

FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation

Abstract

Many of the recent successful methods for video object segmentation (VOS) are overly complicated, heavily rely on fine-tuning on the first frame, and/or are slow, and are hence of limited practical use. In this work, we propose FEELVOS as a simple and fast method which does not rely on fine-tuning. In order to segment a video, for each frame FEELVOS uses a semantic pixel-wise embedding together with a global and a local matching mechanism to transfer information from the first frame and from the previous frame of the video to the current frame. In contrast to previous work, our embedding is only used as an internal guidance of a convolutional network. Our novel dynamic segmentation head allows us to train the network, including the embedding, end-to-end for the multiple object segmentation task with a cross entropy loss. We achieve a new state of the art in video object segmentation without fine-tuning with a J&F measure of 71.5% on the DAVIS 2017 validation set. We make our code and models available at https://github.com/tensorflow/models/tree/master/research/feelvos.

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-video-object-segmentation-on-1FEELVOS
F-measure (Decay): 33.5
F-measure (Mean): 60.4
F-measure (Recall): 68.5
Ju0026F: 57.8
Jaccard (Decay): 29.8
Jaccard (Mean): 55.1
Jaccard (Recall): 62.6
semi-supervised-video-object-segmentation-on-20FEELVOS
D16 val (F): 83.1
D16 val (G): 81.7
D16 val (J): 80.3
D17 test (F): 57.5
D17 test (G): 54.4
D17 test (J): 51.2
D17 val (F): 72.3
D17 val (G): 69.1
D17 val (J): 65.9
FPS: 2.22
video-object-segmentation-on-youtubeFEELVOS
mIoU: 0.821
visual-object-tracking-on-davis-2016FEELVOS
F-measure (Decay): 14.1
F-measure (Mean): 82.2
F-measure (Recall): 86.6
Ju0026F: 81.65
Jaccard (Decay): 13.7
Jaccard (Mean): 81.1
Jaccard (Recall): 90.5
visual-object-tracking-on-davis-2017FEELVOS
F-measure (Decay): 20.1
F-measure (Mean): 74.0
F-measure (Recall): 83.8
Ju0026F: 71.55
Jaccard (Decay): 17.5
Jaccard (Mean): 69.1
Jaccard (Recall): 79.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp