Command Palette
Search for a command to run...
Michał Znaleźniak Przemysław Rola Patryk Kaszuba Jacek Tabor Marek Śmieja

Abstract
Deep clustering has been dominated by flat models, which split a dataset into a predefined number of groups. Although recent methods achieve an extremely high similarity with the ground truth on popular benchmarks, the information contained in the flat partition is limited. In this paper, we introduce CoHiClust, a Contrastive Hierarchical Clustering model based on deep neural networks, which can be applied to typical image data. By employing a self-supervised learning approach, CoHiClust distills the base network into a binary tree without access to any labeled data. The hierarchical clustering structure can be used to analyze the relationship between clusters, as well as to measure the similarity between data points. Experiments demonstrate that CoHiClust generates a reasonable structure of clusters, which is consistent with our intuition and image semantics. Moreover, it obtains superior clustering accuracy on most of the image datasets compared to the state-of-the-art flat clustering models.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-clustering-on-cifar-10 | CoHiClust | ARI: 0.731 Accuracy: 0.839 Backbone: ResNet-50 NMI: 0.779 Train set: Train |
| image-clustering-on-cifar-100 | CoHiClust | ARI: 0.299 Accuracy: 0.437 NMI: 0.467 |
| image-clustering-on-fashion-mnist | CoHiClust | Accuracy: 0.65 |
| image-clustering-on-imagenet-10 | CoHiClust | ARI: 0.899 Accuracy: 0.953 Backbone: ResNet-50 NMI: 0.907 |
| image-clustering-on-imagenet-dog-15 | CoHiClust | ARI: 0.232 Accuracy: 0.355 Backbone: ResNet-50 NMI: 0.411 |
| image-clustering-on-mnist | CoHiClust | Accuracy: 0.99 |
| image-clustering-on-stl-10 | CoHiClust | ARI: 0.474 Accuracy: 0.613 Backbone: ResNet-50 NMI: 0.584 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.