HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Deep clustering: On the link between discriminative models and K-means

Mohammed Jabi; Marco Pedersoli; Amar Mitiche; Ismail Ben Ayed

Deep clustering: On the link between discriminative models and K-means

Abstract

In the context of recent deep clustering studies, discriminative models dominate the literature and report the most competitive performances. These models learn a deep discriminative neural network classifier in which the labels are latent. Typically, they use multinomial logistic regression posteriors and parameter regularization, as is very common in supervised learning. It is generally acknowledged that discriminative objective functions (e.g., those based on the mutual information or the KL divergence) are more flexible than generative approaches (e.g., K-means) in the sense that they make fewer assumptions about the data distributions and, typically, yield much better unsupervised deep learning results. On the surface, several recent discriminative models may seem unrelated to K-means. This study shows that these models are, in fact, equivalent to K-means under mild conditions and common posterior models and parameter regularization. We prove that, for the commonly used logistic regression posteriors, maximizing the $L_2$ regularized mutual information via an approximate alternating direction method (ADM) is equivalent to a soft and regularized K-means loss. Our theoretical analysis not only connects directly several recent state-of-the-art discriminative models to K-means, but also leads to a new soft and regularized deep K-means algorithm, which yields competitive performance on several image clustering benchmarks.

Code Repositories

MOhammedJAbi/SoftKMeans
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-clustering-on-cmu-pieSR-K-means
Accuracy: 0.902
NMI: 0.945
image-clustering-on-frgcSR-K-means
Accuracy: 0.413
NMI: 0.487
image-clustering-on-mnist-fullSR-K-means
NMI: 0.913
image-clustering-on-mnist-testSR-K-means
Accuracy: 0.863
NMI: 0.873
image-clustering-on-uspsSR-K-means
Accuracy: 0.974
NMI: 0.936
image-clustering-on-youtube-faces-dbSR-K-means
Accuracy: 0.605
NMI: 0.806

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp