HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Class-Wise Difficulty-Balanced Loss for Solving Class-Imbalance

Saptarshi Sinha Hiroki Ohashi Katsuyuki Nakamura

Class-Wise Difficulty-Balanced Loss for Solving Class-Imbalance

Abstract

Class-imbalance is one of the major challenges in real world datasets, where a few classes (called majority classes) constitute much more data samples than the rest (called minority classes). Learning deep neural networks using such datasets leads to performances that are typically biased towards the majority classes. Most of the prior works try to solve class-imbalance by assigning more weights to the minority classes in various manners (e.g., data re-sampling, cost-sensitive learning). However, we argue that the number of available training data may not be always a good clue to determine the weighting strategy because some of the minority classes might be sufficiently represented even by a small number of training data. Overweighting samples of such classes can lead to drop in the model's overall performance. We claim that the 'difficulty' of a class as perceived by the model is more important to determine the weighting. In this light, we propose a novel loss function named Class-wise Difficulty-Balanced loss, or CDB loss, which dynamically distributes weights to each sample according to the difficulty of the class that the sample belongs to. Note that the assigned weights dynamically change as the 'difficulty' for the model may change with the learning progress. Extensive experiments are conducted on both image (artificially induced class-imbalanced MNIST, long-tailed CIFAR and ImageNet-LT) and video (EGTEA) datasets. The results show that CDB loss consistently outperforms the recently proposed loss functions on class-imbalanced datasets irrespective of the data type (i.e., video or image).

Code Repositories

hitachi-rd-cv/CDB-loss
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
long-tail-learning-on-cifar-100-lt-r-10CDB-loss
Error Rate: 41.26
long-tail-learning-on-cifar-100-lt-r-100CDB-loss
Error Rate: 57.43
long-tail-learning-on-egteaCDB-loss (3D- ResNeXt101)
Average Precision: 63.86
Average Recall: 66.24
long-tail-learning-on-imagenet-ltCDB-loss (ResNet 10)
Top-1 Accuracy: 38.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp