HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Effective Convolutional Attention Network for Multi-label Clinical Document Classification

{Thomas Schaaf Matthew R. Gormley Russell Klopfer Hua Cheng Yang Liu}

Effective Convolutional Attention Network for Multi-label Clinical Document Classification

Abstract

Multi-label document classification (MLDC) problems can be challenging, especially for long documents with a large label set and a long-tail distribution over labels. In this paper, we present an effective convolutional attention network for the MLDC problem with a focus on medical code prediction from clinical documents. Our innovations are three-fold: (1) we utilize a deep convolution-based encoder with the squeeze-and-excitation networks and residual networks to aggregate the information across the document and learn meaningful document representations that cover different ranges of texts; (2) we explore multi-layer and sum-pooling attention to extract the most informative features from these multi-scale representations; (3) we combine binary cross entropy loss and focal loss to improve performance for rare labels. We focus our evaluation study on MIMIC-III, a widely used dataset in the medical domain. Our models outperform prior work on medical coding and achieve new state-of-the-art results on multiple metrics. We also demonstrate the language independent nature of our approach by applying it to two non-English datasets. Our model outperforms prior best model and a multilingual Transformer model by a substantial margin.

Benchmarks

BenchmarkMethodologyMetrics
medical-code-prediction-on-mimic-iiiEffectiveCAN
Macro-AUC: 91.5
Macro-F1: 10.6
Micro-AUC: 98.8
Micro-F1: 58.9
Precision@15: 60.6
Precision@8: 75.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Effective Convolutional Attention Network for Multi-label Clinical Document Classification | Papers | HyperAI