HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

PolyFormer: Referring Image Segmentation as Sequential Polygon Generation

Jiang Liu Hui Ding Zhaowei Cai Yuting Zhang Ravi Kumar Satzoda Vijay Mahadevan R. Manmatha

PolyFormer: Referring Image Segmentation as Sequential Polygon Generation

Abstract

In this work, instead of directly predicting the pixel-level segmentation masks, the problem of referring image segmentation is formulated as sequential polygon generation, and the predicted polygons can be later converted into segmentation masks. This is enabled by a new sequence-to-sequence framework, Polygon Transformer (PolyFormer), which takes a sequence of image patches and text query tokens as input, and outputs a sequence of polygon vertices autoregressively. For more accurate geometric localization, we propose a regression-based decoder, which predicts the precise floating-point coordinates directly, without any coordinate quantization error. In the experiments, PolyFormer outperforms the prior art by a clear margin, e.g., 5.40% and 4.52% absolute improvements on the challenging RefCOCO+ and RefCOCOg datasets. It also shows strong generalization ability when evaluated on the referring video segmentation task without fine-tuning, e.g., achieving competitive 61.5% J&F on the Ref-DAVIS17 dataset.

Code Repositories

amazon-science/polygon-transformer
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
referring-expression-segmentation-on-davisPolyFormer-B
Ju0026F 1st frame: 60.9
Zero-Shot Transfer: true
referring-expression-segmentation-on-refcocoPolyFormer-L
Mean IoU: 76.94
Overall IoU: 75.96
referring-expression-segmentation-on-refcocoPolyFormer-B
Overall IoU: 74.82
referring-expression-segmentation-on-refcoco-3PolyFormer-L
Mean IoU: 72.15
Overall IoU: 69.33
referring-expression-segmentation-on-refcoco-3PolyFormer-B
Mean IoU: 70.65
Overall IoU: 67.64
referring-expression-segmentation-on-refcoco-4PolyFormer-B
Mean IoU: 74.51
Overall IoU: 72.89
referring-expression-segmentation-on-refcoco-4PolyFormer-L
Mean IoU: 75.71
Overall IoU: 74.56
referring-expression-segmentation-on-refcoco-5PolyFormer-L
Mean IoU: 66.73
Overall IoU: 61.87
referring-expression-segmentation-on-refcoco-5PolyFormer-B
Mean IoU: 64.64
Overall IoU: 59.33
referring-expression-segmentation-on-refcocogPolyFormer-L
Mean IoU: 71.15
Overall IoU: 69.2
referring-expression-segmentation-on-refcocogPolyFormer-B
Mean IoU: 69.36
Overall IoU: 67.76
referring-expression-segmentation-on-refcocog-1PolyFormer-L
Mean IoU: 71.17
Overall IoU: 70.19
referring-expression-segmentation-on-refcocog-1PolyFormer-B
Mean IoU: 69.88
Overall IoU: 69.05
referring-expression-segmentation-on-referitPolyFormer-L
Mean IoU: 67.22
Overall IoU: 72.6
referring-expression-segmentation-on-referitPolyFormer-B
Mean IoU: 65.98
Overall IoU: 71.91

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp