HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition

Gao Bin-Bin ; Zhou Hong-Yu

Learning to Discover Multi-Class Attentional Regions for Multi-Label
  Image Recognition

Abstract

Multi-label image recognition is a practical and challenging task compared tosingle-label image classification. However, previous works may be suboptimalbecause of a great number of object proposals or complex attentional regiongeneration modules. In this paper, we propose a simple but efficient two-streamframework to recognize multi-category objects from global image to localregions, similar to how human beings perceive objects. To bridge the gapbetween global and local streams, we propose a multi-class attentional regionmodule which aims to make the number of attentional regions as small aspossible and keep the diversity of these regions as high as possible. Ourmethod can efficiently and effectively recognize multi-class objects with anaffordable computation cost and a parameter-free region localization module.Over three benchmarks on multi-label image classification, we create newstate-of-the-art results with a single model only using image semantics withoutlabel dependency. In addition, the effectiveness of the proposed method isextensively demonstrated under different factors such as global poolingstrategy, input size and network architecture. Code has been made availableat~\url{https://github.com/gaobb/MCAR}.

Code Repositories

gaobb/MCAR
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
multi-label-classification-on-ms-cocoMCAR (ResNet101, 448x448)
mAP: 83.8
multi-label-classification-on-ms-cocoMCAR (ResNet101, 576x576)
mAP: 84.5
multi-label-classification-on-pascal-voc-2007MCAR (ResNet101, 448x448)
mAP: 94.8
multi-label-classification-on-pascal-voc-2012MCAR (ResNet101, 448x448)
mAP: 94.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp