Command Palette
Search for a command to run...
Varga Viktor ; Lőrincz András

Abstract
Pixelwise annotation of image sequences can be very tedious for humans.Interactive video object segmentation aims to utilize automatic methods tospeed up the process and reduce the workload of the annotators. Mostcontemporary approaches rely on deep convolutional networks to collect andprocess information from human annotations throughout the video. However, suchnetworks contain millions of parameters and need huge amounts of labeledtraining data to avoid overfitting. Beyond that, label propagation is usuallyexecuted as a series of frame-by-frame inference steps, which is difficult tobe parallelized and is thus time consuming. In this paper we present a graphneural network based approach for tackling the problem of interactive videoobject segmentation. Our network operates on superpixel-graphs which allow usto reduce the dimensionality of the problem by several magnitudes. We show,that our network possessing only a few thousand parameters is able to achievestate-of-the-art performance, while inference remains fast and can be trainedquickly with very little data.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| interactive-video-object-segmentation-on | GNNannot | AUC-J: 0.759 AUC-Ju0026F: 0.782 Ju0026F@60s: 0.790 J@60s: 0.767 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.