Command Palette
Search for a command to run...
Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study
Gregory Holste Song Wang Ziyu Jiang Thomas C. Shen George Shih Ronald M. Summers Yifan Peng Zhangyang Wang

Abstract
Imaging exams, such as chest radiography, will yield a small set of common findings and a much larger set of uncommon findings. While a trained radiologist can learn the visual presentation of rare conditions by studying a few representative examples, teaching a machine to learn from such a "long-tailed" distribution is much more difficult, as standard methods would be easily biased toward the most frequent classes. In this paper, we present a comprehensive benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays. We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common "head" classes, but also the rare yet critical "tail" classes. To accomplish this, we introduce a challenging new long-tailed chest X-ray benchmark to facilitate research on developing long-tailed learning methods for medical image classification. The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images. We evaluate both standard and state-of-the-art long-tailed learning methods on this new benchmark, analyzing which aspects of these methods are most beneficial for long-tailed medical image classification and summarizing insights for future algorithm design. The datasets, trained models, and code are available at https://github.com/VITA-Group/LongTailCXR.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| long-tail-learning-on-mimic-cxr-lt | MixUp | Balanced Accuracy: 0.176 |
| long-tail-learning-on-mimic-cxr-lt | Reweighted Focal Loss | Balanced Accuracy: 0.239 |
| long-tail-learning-on-mimic-cxr-lt | Class-balanced LDAM-DRW | Balanced Accuracy: 0.267 |
| long-tail-learning-on-mimic-cxr-lt | Class-balanced Softmax | Balanced Accuracy: 0.227 |
| long-tail-learning-on-mimic-cxr-lt | Reweighted Softmax | Balanced Accuracy: 0.211 |
| long-tail-learning-on-mimic-cxr-lt | Softmax | Balanced Accuracy: 0.169 |
| long-tail-learning-on-mimic-cxr-lt | Decoupling (tau-norm) | Balanced Accuracy: 0.230 |
| long-tail-learning-on-mimic-cxr-lt | Class-balanced Focal Loss | Balanced Accuracy: 0.191 |
| long-tail-learning-on-mimic-cxr-lt | Class-balanced LDAM | Balanced Accuracy: 0.225 |
| long-tail-learning-on-mimic-cxr-lt | Reweighted LDAM | Balanced Accuracy: 0.243 |
| long-tail-learning-on-mimic-cxr-lt | Balanced-MixUp | Balanced Accuracy: 0.168 |
| long-tail-learning-on-mimic-cxr-lt | LDAM | Balanced Accuracy: 0.165 |
| long-tail-learning-on-mimic-cxr-lt | Focal Loss | Balanced Accuracy: 0.172 |
| long-tail-learning-on-mimic-cxr-lt | Reweighted LDAM-DRW | Balanced Accuracy: 0.275 |
| long-tail-learning-on-mimic-cxr-lt | Decoupling (cRT) | Balanced Accuracy: 0.296 |
| long-tail-learning-on-nih-cxr-lt | Reweighted Focal Loss | Balanced Accuracy: 0.197 |
| long-tail-learning-on-nih-cxr-lt | Focal Loss | Balanced Accuracy: 0.122 |
| long-tail-learning-on-nih-cxr-lt | Balanced-MixUp | Balanced Accuracy: 0.155 |
| long-tail-learning-on-nih-cxr-lt | Softmax | Balanced Accuracy: 0.115 |
| long-tail-learning-on-nih-cxr-lt | Reweighted LDAM | Balanced Accuracy: 0.279 |
| long-tail-learning-on-nih-cxr-lt | LDAM | Balanced Accuracy: 0.178 |
| long-tail-learning-on-nih-cxr-lt | Class-balanced LDAM | Balanced Accuracy: 0.235 |
| long-tail-learning-on-nih-cxr-lt | Decoupling (cRT) | Balanced Accuracy: 0.294 |
| long-tail-learning-on-nih-cxr-lt | Reweighted LDAM-DRW | Balanced Accuracy: 0.289 |
| long-tail-learning-on-nih-cxr-lt | Decoupling (tau-norm) | Balanced Accuracy: 0.214 |
| long-tail-learning-on-nih-cxr-lt | Class-Balanced Focal Loss | Balanced Accuracy: 0.232 |
| long-tail-learning-on-nih-cxr-lt | MixUp | Balanced Accuracy: 0.118 |
| long-tail-learning-on-nih-cxr-lt | Class-balanced LDAM-DRW | Balanced Accuracy: 0.281 |
| long-tail-learning-on-nih-cxr-lt | Reweighted Softmax | Balanced Accuracy: 0.260 |
| long-tail-learning-on-nih-cxr-lt | Class-Balanced Softmax | Balanced Accuracy: 0.269 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.