HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Twin Contrastive Learning for Online Clustering

Li Yunfan ; Yang Mouxing ; Peng Dezhong ; Li Taihao ; Huang Jiantao ; Peng Xi

Twin Contrastive Learning for Online Clustering

Abstract

This paper proposes to perform online clustering by conducting twincontrastive learning (TCL) at the instance and cluster level. Specifically, wefind that when the data is projected into a feature space with a dimensionalityof the target cluster number, the rows and columns of its feature matrixcorrespond to the instance and cluster representation, respectively. Based onthe observation, for a given dataset, the proposed TCL first constructspositive and negative pairs through data augmentations. Thereafter, in the rowand column space of the feature matrix, instance- and cluster-level contrastivelearning are respectively conducted by pulling together positive pairs whilepushing apart the negatives. To alleviate the influence of intrinsicfalse-negative pairs and rectify cluster assignments, we adopt aconfidence-based criterion to select pseudo-labels for boosting both theinstance- and cluster-level contrastive learning. As a result, the clusteringperformance is further improved. Besides the elegant idea of twin contrastivelearning, another advantage of TCL is that it could independently predict thecluster assignment for each instance, thus effortlessly fitting onlinescenarios. Extensive experiments on six widely-used image and text benchmarksdemonstrate the effectiveness of TCL. The code will be released on GitHub.

Benchmarks

BenchmarkMethodologyMetrics
image-clustering-on-cifar-10TCL
ARI: 0.780
Accuracy: 0.887
Backbone: ResNet-34
NMI: 0.819
Train set: Train
image-clustering-on-cifar-100TCL
ARI: 0.357
Accuracy: 0.531
NMI: 0.529
Train Set: Train
image-clustering-on-imagenet-10TCL
ARI: 0.837
Accuracy: 0.895
NMI: 0.875
image-clustering-on-imagenet-dog-15TCL
ARI: 0.516
Accuracy: 0.644
NMI: 0.623
image-clustering-on-stl-10TCL
ARI: 0.757
Accuracy: 0.868
Backbone: ResNet-34
NMI: 0.799
Train Split: Train
short-text-clustering-on-biomedicalTCL
Acc: 49.8
NMI: 42.9
short-text-clustering-on-stackoverflowTCL
Acc: 88.2
NMI: 0.786

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp