HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Graph InfoClust: Leveraging cluster-level node information for unsupervised graph representation learning

Costas Mavromatis George Karypis

Graph InfoClust: Leveraging cluster-level node information for unsupervised graph representation learning

Abstract

Unsupervised (or self-supervised) graph representation learning is essential to facilitate various graph data mining tasks when external supervision is unavailable. The challenge is to encode the information about the graph structure and the attributes associated with the nodes and edges into a low dimensional space. Most existing unsupervised methods promote similar representations across nodes that are topologically close. Recently, it was shown that leveraging additional graph-level information, e.g., information that is shared among all nodes, encourages the representations to be mindful of the global properties of the graph, which greatly improves their quality. However, in most graphs, there is significantly more structure that can be captured, e.g., nodes tend to belong to (multiple) clusters that represent structurally similar nodes. Motivated by this observation, we propose a graph representation learning method called Graph InfoClust (GIC), that seeks to additionally capture cluster-level information content. These clusters are computed by a differentiable K-means method and are jointly optimized by maximizing the mutual information between nodes of the same clusters. This optimization leads the node representations to capture richer information and nodal interactions, which improves their quality. Experiments show that GIC outperforms state-of-art methods in various downstream tasks (node classification, link prediction, and node clustering) with a 0.9% to 6.1% gain over the best competing approach, on average.

Code Repositories

cmavro/Graph-InfoClust-GIC
pytorch
Mentioned in GitHub
cmavro/HeMI
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
link-prediction-on-citeseerGraph InfoClust (GIC)
AP: 96.8
AUC: 97
link-prediction-on-corasGraphite-VAE
AP: 93.5%
AUC: 93.7%
link-prediction-on-pubmedGraph InfoClust (GIC)
AP: 93.5%
AUC: 93.7%
node-classification-on-amz-compGraph InfoClust (GIC)
Accuracy: 81.5 ± 1.0
node-classification-on-amz-photoGraph InfoClust (GIC)
Accuracy: 90.4 ± 1.0
node-classification-on-citeseerGraph InfoClust (GIC)
Accuracy: 71.9 ± 1.4
node-classification-on-coauthor-csGraph InfoClust (GIC)
Accuracy: 89.4 ± 0.4
node-classification-on-coauthor-phyGraph InfoClust (GIC)
Accuracy: 93.1 ± 0.7
node-classification-on-cora-fixed-20-node-perGraph InfoClust (GIC)
Accuracy: 81.7 ± 1.5
node-classification-on-pubmedGraph InfoClust (GIC)
Accuracy: 77.4 ± 1.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp