HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Large-scale spectral clustering using diffusion coordinates on landmark-based bipartite graphs

{Khiem Pham Guangliang Chen}

Large-scale spectral clustering using diffusion coordinates on landmark-based bipartite graphs

Abstract

Spectral clustering has received a lot of attention due to its ability to separate nonconvex, non-intersecting manifolds, but its high computational complexity has significantly limited its applicability. Motivated by the document-term co-clustering framework by Dhillon (2001), we propose a landmark-based scalable spectral clustering approach in which we first use the selected landmark set and the given data to form a bipartite graph and then run a diffusion process on it to obtain a family of diffusion coordinates for clustering. We show that our proposed algorithm can be implemented based on very efficient operations on the affinity matrix between the given data and selected landmarks, thus capable of handling large data. Finally, we demonstrate the excellent performance of our method by comparing with the state-of-the-art scalable algorithms on several benchmark data sets.

Benchmarks

BenchmarkMethodologyMetrics
image-document-clustering-on-pendigitsLBDM
Accuracy (%): 74.70
runtime (s): 3.08

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Large-scale spectral clustering using diffusion coordinates on landmark-based bipartite graphs | Papers | HyperAI