Command Palette
Search for a command to run...
Li Hongmin ; Ye Xiucai ; Imakura Akira ; Sakurai Tetsuya

Abstract
Spectral clustering is one of the most popular clustering methods. However,how to balance the efficiency and effectiveness of the large-scale spectralclustering with limited computing resources has not been properly solved for along time. In this paper, we propose a divide-and-conquer based large-scalespectral clustering method to strike a good balance between efficiency andeffectiveness. In the proposed method, a divide-and-conquer based landmarkselection algorithm and a novel approximate similarity matrix approach aredesigned to construct a sparse similarity matrix within low computationalcomplexities. Then clustering results can be computed quickly through abipartite graph partition process. The proposed method achieves a lowercomputational complexity than most existing large-scale spectral clusteringmethods. Experimental results on ten large-scale datasets have demonstrated theefficiency and effectiveness of the proposed method. The MATLAB code of theproposed method and experimental datasets are available athttps://github.com/Li-Hongmin/MyPaperWithCode.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-document-clustering-on-pendigits | U-SPEC | Accuracy (%): 81.68 NMI: 81.68 runtime (s): 2.07 |
| image-document-clustering-on-pendigits | LSC-R | Accuracy (%): 81.55 NMI: 79.15 runtime (s): 0.77 |
| image-document-clustering-on-pendigits | LSC-K | Accuracy (%): 74.02 NMI: 81.37 runtime (s): 1.20 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.