Command Palette
Search for a command to run...
Yunfan Li Peng Hu Dezhong Peng Jiancheng Lv Jianping Fan Xi Peng

Abstract
The core of clustering is incorporating prior knowledge to construct supervision signals. From classic k-means based on data compactness to recent contrastive clustering guided by self-supervision, the evolution of clustering methods intrinsically corresponds to the progression of supervision signals. At present, substantial efforts have been devoted to mining internal supervision signals from data. Nevertheless, the abundant external knowledge such as semantic descriptions, which naturally conduces to clustering, is regrettably overlooked. In this work, we propose leveraging external knowledge as a new supervision signal to guide clustering, even though it seems irrelevant to the given data. To implement and validate our idea, we design an externally guided clustering method (Text-Aided Clustering, TAC), which leverages the textual semantics of WordNet to facilitate image clustering. Specifically, TAC first selects and retrieves WordNet nouns that best distinguish images to enhance the feature discriminability. Then, to improve image clustering performance, TAC collaborates text and image modalities by mutually distilling cross-modal neighborhood information. Experiments demonstrate that TAC achieves state-of-the-art performance on five widely used and three more challenging image clustering benchmarks, including the full ImageNet-1K dataset.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-classification-on-imagenet-10 | TAC | ARI: 0.983 |
| image-clustering-on-cifar-10 | TAC | ARI: 0.831 Accuracy: 0.919 NMI: 0.833 |
| image-clustering-on-cifar-20 | TAC | ARI: 0.448 Accuracy: 0.607 NMI: 0.611 |
| image-clustering-on-dtd | TAC | ARI: 34.4 Accuracy: 50.1 NMI: 62.1 |
| image-clustering-on-imagenet-10 | TAC | Accuracy: 0.992 NMI: 0.985 |
| image-clustering-on-imagenet-1k | TAC | ARI: 0.435 Accuracy: 0.582 NMI: 0.799 |
| image-clustering-on-stl-10 | TAC | ARI: 0.961 Accuracy: 0.982 NMI: 0.955 |
| image-clustering-on-ucf101 | TAC | ARI: 0.601 Accuracy: 0.687 NMI: 0.823 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.