HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Learn from Each Other to Classify Better: Cross-layer Mutual Attention Learning for Fine-grained Visual Classification

{Jien Kato Yu Wang Longjiao Zhao Dichao Liu}

Abstract

Fine-grained visual classification (FGVC) is valuable yet challenging. The difficulty of FGVC mainly lies in its intrinsic inter-class similarity, intra-class variation, and limited training data. Moreover, with the popularity of deep convolutional neural networks, researchers have mainly used deep, abstract, semantic information for FGVC, while shallow, detailed information has been neglected. This work proposes a cross-layer mutual attention learning network (CMAL-Net) to solve the above problems. Specifically, this work views the shallow to deep layers of CNNs as “experts” knowledgeable about different perspectives. We let each expert give a category prediction and an attention region indicating the found clues. Attention regions are treated as information carriers among experts, bringing three benefits: (ⅰ) helping the model focus on discriminative regions; (ⅱ) providing more training data; (ⅲ) allowing experts to learn from each other to improve the overall performance. CMAL-Net achieves state-of-the-art performance on three competitive datasets: FGVC-Aircraft, Stanford Cars, and Food-11.

Benchmarks

BenchmarkMethodologyMetrics
fine-grained-image-classification-on-fgvcCMAL-Net
Accuracy: 94.7%
fine-grained-image-classification-on-stanfordCMAL-Net
Accuracy: 97.1%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Learn from Each Other to Classify Better: Cross-layer Mutual Attention Learning for Fine-grained Visual Classification | Papers | HyperAI