HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation

Chanyoung Kim; Woojung Han; Dayun Ju; Seong Jae Hwang

EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation

Abstract

Semantic segmentation has innately relied on extensive pixel-level annotated data, leading to the emergence of unsupervised methodologies. Among them, leveraging self-supervised Vision Transformers for unsupervised semantic segmentation (USS) has been making steady progress with expressive deep features. Yet, for semantically segmenting images with complex objects, a predominant challenge remains: the lack of explicit object-level semantic encoding in patch-level features. This technical limitation often leads to inadequate segmentation of complex objects with diverse structures. To address this gap, we present a novel approach, EAGLE, which emphasizes object-centric representation learning for unsupervised semantic segmentation. Specifically, we introduce EiCue, a spectral technique providing semantic and structural cues through an eigenbasis derived from the semantic similarity matrix of deep image features and color affinity from an image. Further, by incorporating our object-centric contrastive loss with EiCue, we guide our model to learn object-level representations with intra- and inter-image object-feature consistency, thereby enhancing semantic accuracy. Extensive experiments on COCO-Stuff, Cityscapes, and Potsdam-3 datasets demonstrate the state-of-the-art USS results of EAGLE with accurate and consistent semantic segmentation across complex scenes.

Code Repositories

MICV-yonsei/EAGLE
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
unsupervised-semantic-segmentation-onEAGLE (DINO, ViT-S/8)
Accuracy: 81.8
mIoU: 19.7
unsupervised-semantic-segmentation-onEAGLE (DINO, ViT-B/8)
Accuracy: 79.4
mIoU: 22.1
unsupervised-semantic-segmentation-on-coco-7EAGLE (DINO, ViT-S/8)
Accuracy: 64.2
mIoU: 27.2
unsupervised-semantic-segmentation-on-potsdam-1EAGLE (DINO, ViT-B/8)
Accuracy: 83.3
mIoU: 71.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp