HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation

Miriam Bellver Carles Ventura Carina Silberer Ioannis Kazakos Jordi Torres Xavier Giro-i-Nieto

RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation

Abstract

The task of video object segmentation with referring expressions (language-guided VOS) is to, given a linguistic phrase and a video, generate binary masks for the object to which the phrase refers. Our work argues that existing benchmarks used for this task are mainly composed of trivial cases, in which referents can be identified with simple phrases. Our analysis relies on a new categorization of the phrases in the DAVIS-2017 and Actor-Action datasets into trivial and non-trivial REs, with the non-trivial REs annotated with seven RE semantic categories. We leverage this data to analyze the results of RefVOS, a novel neural network that obtains competitive results for the task of language-guided image segmentation and state of the art results for language-guided VOS. Our study indicates that the major challenges for the task are related to understanding motion and static actions.

Code Repositories

miriambellver/refvos
Official
pytorch
Mentioned in GitHub
imatge-upc/refvos
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
referring-expression-segmentation-on-a2dRefVOS
IoU mean: 0.599
IoU overall: 0.599
Precision@0.5: 0.495
Precision@0.9: 0.064
referring-expression-segmentation-on-a2dreRefVos
Mean IoU: 33.2
Overall IoU: 47.5
referring-expression-segmentation-on-davisRefVOS
Ju0026F 1st frame: 44.5
Ju0026F Full video: 45.1
referring-expression-segmentation-on-refcocoRefVOS with BERT + MLM loss
Overall IoU: 59.45
referring-expression-segmentation-on-refcocoRefVOS with BERT Pre-train
Overall IoU: 58.65
referring-expression-segmentation-on-refcoco-3RefVOS with BERT + MLM loss
Overall IoU: 44.71
referring-expression-segmentation-on-refcoco-4RefVOS with BERT + MLM Loss
Overall IoU: 49.73
referring-expression-segmentation-on-refcoco-5RefVOS with BERT + MLM loss
Overall IoU: 36.17

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp