HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Hierarchical Open-vocabulary Universal Image Segmentation

Wang Xudong ; Li Shufan ; Kallidromitis Konstantinos ; Kato Yusuke ; Kozuka Kazuki ; Darrell Trevor

Hierarchical Open-vocabulary Universal Image Segmentation

Abstract

Open-vocabulary image segmentation aims to partition an image into semanticregions according to arbitrary text descriptions. However, complex visualscenes can be naturally decomposed into simpler parts and abstracted atmultiple levels of granularity, introducing inherent segmentation ambiguity.Unlike existing methods that typically sidestep this ambiguity and treat it asan external factor, our approach actively incorporates a hierarchicalrepresentation encompassing different semantic-levels into the learningprocess. We propose a decoupled text-image fusion mechanism and representationlearning modules for both "things" and "stuff". Additionally, we systematicallyexamine the differences that exist in the textual and visual features betweenthese types of categories. Our resulting model, named HIPIE, tacklesHIerarchical, oPen-vocabulary, and unIvErsal segmentation tasks within aunified framework. Benchmarked on over 40 datasets, e.g., ADE20K, COCO,Pascal-VOC Part, RefCOCO/RefCOCOg, ODinW and SeginW, HIPIE achieves thestate-of-the-art results at various levels of image comprehension, includingsemantic-level (e.g., semantic segmentation), instance-level (e.g.,panoptic/referring segmentation and object detection), as well as part-level(e.g., part/subpart segmentation) tasks. Our code is released athttps://github.com/berkeley-hipie/HIPIE.

Code Repositories

berkeley-hipie/hipie
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-segmentation-on-pascal-panoptic-partsHIPIE (ResNet-50)
mIoUPartS: 57.2
image-segmentation-on-pascal-panoptic-partsHIPIE (ViT-H)
mIoUPartS: 63.8
panoptic-segmentation-on-coco-minivalHIPIE (ViT-H, single-scale)
PQ: 58.1
mIoU: 66.8
referring-expression-segmentation-on-refcocoHIPIE
Overall IoU: 82.8
referring-expression-segmentation-on-refcoco-3HIPIE
Overall IoU: 73.9
zero-shot-segmentation-on-segmentation-in-theHIPIE
Mean AP: 41.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp