HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging

Singh Shubhr ; Benetos Emmanouil ; Phan Huy ; Stowell Dan

LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification
  and Tagging

Abstract

Transformers have set new benchmarks in audio processing tasks, leveragingself-attention mechanisms to capture complex patterns and dependencies withinaudio data. However, their focus on pairwise interactions limits their abilityto process the higher-order relations essential for identifying distinct audioobjects. To address this limitation, this work introduces the Local- HigherOrder Graph Neural Network (LHGNN), a graph based model that enhances featureunderstanding by integrating local neighbourhood information with higher-orderdata from Fuzzy C-Means clusters, thereby capturing a broader spectrum of audiorelationships. Evaluation of the model on three publicly available audiodatasets shows that it outperforms Transformer-based models across allbenchmarks while operating with substantially fewer parameters. Moreover, LHGNNdemonstrates a distinct advantage in scenarios lacking ImageNet pretraining,establishing its effectiveness and efficiency in environments where extensivepretraining data is unavailable.

Benchmarks

BenchmarkMethodologyMetrics
audio-classification-on-audio-setLHGNN
Mean AP: 46.6
audio-classification-on-esc-50LHGNN
Top-1 Accuracy: 96.2
audio-classification-on-fsd50kLHGNN
Mean AP: 59

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging | Papers | HyperAI