HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Difficulty-Net: Learning to Predict Difficulty for Long-Tailed Recognition

Saptarshi Sinha Hiroki Ohashi

Difficulty-Net: Learning to Predict Difficulty for Long-Tailed Recognition

Abstract

Long-tailed datasets, where head classes comprise much more training samples than tail classes, cause recognition models to get biased towards the head classes. Weighted loss is one of the most popular ways of mitigating this issue, and a recent work has suggested that class-difficulty might be a better clue than conventionally used class-frequency to decide the distribution of weights. A heuristic formulation was used in the previous work for quantifying the difficulty, but we empirically find that the optimal formulation varies depending on the characteristics of datasets. Therefore, we propose Difficulty-Net, which learns to predict the difficulty of classes using the model's performance in a meta-learning framework. To make it learn reasonable difficulty of a class within the context of other classes, we newly introduce two key concepts, namely the relative difficulty and the driver loss. The former helps Difficulty-Net take other classes into account when calculating difficulty of a class, while the latter is indispensable for guiding the learning to a meaningful direction. Extensive experiments on popular long-tailed datasets demonstrated the effectiveness of the proposed method, and it achieved state-of-the-art performance on multiple long-tailed datasets.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
long-tail-learning-on-cifar-100-lt-r-10Difficulty-Net
Error Rate: 34.78
long-tail-learning-on-cifar-100-lt-r-100Difficulty-Net
Error Rate: 47.04
long-tail-learning-on-cifar-100-lt-r-50Difficulty-Net
Error Rate: 43.1
long-tail-learning-on-imagenet-ltDifficulty-Net (ResNet-50 w/o using RandAugment, single model)
Top-1 Accuracy: 54.0
long-tail-learning-on-imagenet-ltDifficulty-Net (ResNet-10 w/o using RandAugment, single model
Top-1 Accuracy: 44.6
long-tail-learning-on-imagenet-ltDifficulty-Net (ResNet-50 using RandAugment, single model)
Top-1 Accuracy: 57.4
long-tail-learning-on-places-ltDifficulty-Net (ResNet-152)
Top-1 Accuracy: 41.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp