HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Representation Learning for Clustering via Building Consensus

Aniket Anand Deshmukh Jayanth Reddy Regatti Eren Manavoglu Urun Dogan

Representation Learning for Clustering via Building Consensus

Abstract

In this paper, we focus on unsupervised representation learning for clustering of images. Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must be close in the representation space (exemplar consistency), and/or similar images must have similar cluster assignments (population consistency). We define an additional notion of consistency, consensus consistency, which ensures that representations are learned to induce similar partitions for variations in the representation space, different clustering algorithms or different initializations of a single clustering algorithm. We define a clustering loss by executing variations in the representation space and seamlessly integrate all three consistencies (consensus, exemplar and population) into an end-to-end learning framework. The proposed algorithm, consensus clustering using unsupervised representation learning (ConCURL), improves upon the clustering performance of state-of-the-art methods on four out of five image datasets. Furthermore, we extend the evaluation procedure for clustering to reflect the challenges encountered in real-world clustering tasks, such as maintaining clustering performance in cases with distribution shifts. We also perform a detailed ablation study for a deeper understanding of the proposed algorithm. The code and the trained models are available at https://github.com/JayanthRR/ConCURL_NCE.

Code Repositories

JayanthRR/ConCURL_NCE
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-clustering-on-cifar-10ConCURL
ARI: 0.715
Accuracy: 0.846
NMI: 0.762
Train set: Train
image-clustering-on-cifar-100ConCURL
ARI: 0.303
Accuracy: 0.479
NMI: 0.468
Train Set: Train
image-clustering-on-imagenet-10ConCURL
ARI: 0.909
Accuracy: 0.958
NMI: 0.907
image-clustering-on-imagenet-dog-15ConCURL
ARI: 0.531
Accuracy: 0.695
NMI: 0.63
image-clustering-on-stl-10ConCURL
Accuracy: 0.749
NMI: 0.636
Train Split: Train+Test

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp