HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study

Gregory Holste Song Wang Ziyu Jiang Thomas C. Shen George Shih Ronald M. Summers Yifan Peng Zhangyang Wang

Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study

Abstract

Imaging exams, such as chest radiography, will yield a small set of common findings and a much larger set of uncommon findings. While a trained radiologist can learn the visual presentation of rare conditions by studying a few representative examples, teaching a machine to learn from such a "long-tailed" distribution is much more difficult, as standard methods would be easily biased toward the most frequent classes. In this paper, we present a comprehensive benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays. We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common "head" classes, but also the rare yet critical "tail" classes. To accomplish this, we introduce a challenging new long-tailed chest X-ray benchmark to facilitate research on developing long-tailed learning methods for medical image classification. The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images. We evaluate both standard and state-of-the-art long-tailed learning methods on this new benchmark, analyzing which aspects of these methods are most beneficial for long-tailed medical image classification and summarizing insights for future algorithm design. The datasets, trained models, and code are available at https://github.com/VITA-Group/LongTailCXR.

Code Repositories

vita-group/longtailcxr
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
long-tail-learning-on-mimic-cxr-ltMixUp
Balanced Accuracy: 0.176
long-tail-learning-on-mimic-cxr-ltReweighted Focal Loss
Balanced Accuracy: 0.239
long-tail-learning-on-mimic-cxr-ltClass-balanced LDAM-DRW
Balanced Accuracy: 0.267
long-tail-learning-on-mimic-cxr-ltClass-balanced Softmax
Balanced Accuracy: 0.227
long-tail-learning-on-mimic-cxr-ltReweighted Softmax
Balanced Accuracy: 0.211
long-tail-learning-on-mimic-cxr-ltSoftmax
Balanced Accuracy: 0.169
long-tail-learning-on-mimic-cxr-ltDecoupling (tau-norm)
Balanced Accuracy: 0.230
long-tail-learning-on-mimic-cxr-ltClass-balanced Focal Loss
Balanced Accuracy: 0.191
long-tail-learning-on-mimic-cxr-ltClass-balanced LDAM
Balanced Accuracy: 0.225
long-tail-learning-on-mimic-cxr-ltReweighted LDAM
Balanced Accuracy: 0.243
long-tail-learning-on-mimic-cxr-ltBalanced-MixUp
Balanced Accuracy: 0.168
long-tail-learning-on-mimic-cxr-ltLDAM
Balanced Accuracy: 0.165
long-tail-learning-on-mimic-cxr-ltFocal Loss
Balanced Accuracy: 0.172
long-tail-learning-on-mimic-cxr-ltReweighted LDAM-DRW
Balanced Accuracy: 0.275
long-tail-learning-on-mimic-cxr-ltDecoupling (cRT)
Balanced Accuracy: 0.296
long-tail-learning-on-nih-cxr-ltReweighted Focal Loss
Balanced Accuracy: 0.197
long-tail-learning-on-nih-cxr-ltFocal Loss
Balanced Accuracy: 0.122
long-tail-learning-on-nih-cxr-ltBalanced-MixUp
Balanced Accuracy: 0.155
long-tail-learning-on-nih-cxr-ltSoftmax
Balanced Accuracy: 0.115
long-tail-learning-on-nih-cxr-ltReweighted LDAM
Balanced Accuracy: 0.279
long-tail-learning-on-nih-cxr-ltLDAM
Balanced Accuracy: 0.178
long-tail-learning-on-nih-cxr-ltClass-balanced LDAM
Balanced Accuracy: 0.235
long-tail-learning-on-nih-cxr-ltDecoupling (cRT)
Balanced Accuracy: 0.294
long-tail-learning-on-nih-cxr-ltReweighted LDAM-DRW
Balanced Accuracy: 0.289
long-tail-learning-on-nih-cxr-ltDecoupling (tau-norm)
Balanced Accuracy: 0.214
long-tail-learning-on-nih-cxr-ltClass-Balanced Focal Loss
Balanced Accuracy: 0.232
long-tail-learning-on-nih-cxr-ltMixUp
Balanced Accuracy: 0.118
long-tail-learning-on-nih-cxr-ltClass-balanced LDAM-DRW
Balanced Accuracy: 0.281
long-tail-learning-on-nih-cxr-ltReweighted Softmax
Balanced Accuracy: 0.260
long-tail-learning-on-nih-cxr-ltClass-Balanced Softmax
Balanced Accuracy: 0.269

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp