HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

HiRID-ICU-Benchmark -- A Comprehensive Machine Learning Benchmark on High-resolution ICU Data

Hugo Yèche; Rita Kuznetsova; Marc Zimmermann; Matthias Hüser; Xinrui Lyu; Martin Faltys; Gunnar Rätsch

HiRID-ICU-Benchmark -- A Comprehensive Machine Learning Benchmark on High-resolution ICU Data

Abstract

The recent success of machine learning methods applied to time series collected from Intensive Care Units (ICU) exposes the lack of standardized machine learning benchmarks for developing and comparing such methods. While raw datasets, such as MIMIC-IV or eICU, can be freely accessed on Physionet, the choice of tasks and pre-processing is often chosen ad-hoc for each publication, limiting comparability across publications. In this work, we aim to improve this situation by providing a benchmark covering a large spectrum of ICU-related tasks. Using the HiRID dataset, we define multiple clinically relevant tasks in collaboration with clinicians. In addition, we provide a reproducible end-to-end pipeline to construct both data and labels. Finally, we provide an in-depth analysis of current state-of-the-art sequence modeling methods, highlighting some limitations of deep learning approaches for this type of data. With this benchmark, we hope to give the research community the possibility of a fair comparison of their work.

Code Repositories

ratschlab/HIRID-ICU-Benchmark
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
circulatory-failure-on-hiridLSTM
AUPRC: 0.32.2±0.008
circulatory-failure-on-hiridLGBM
AUPRC: 0.389±0.003
circulatory-failure-on-hiridLGBM ( + hand crafted features)
AUPRC: 0.388±0.002
circulatory-failure-on-hiridTCN
AUPRC: 0.35.8±0.006
circulatory-failure-on-hiridGRU
AUPRC: 0.368±0.005
circulatory-failure-on-hiridTransformer
AUPRC: 0.352±0.006
circulatory-failure-on-hiridLR
AUPRC: 0.305±0.000
icu-mortality-on-hiridLogistic Regression
AUPRC: 0.581±0.000
icu-mortality-on-hiridTransformer
AUPRC: 0.610±0.008
icu-mortality-on-hiridGRU
AUPRC: 0.603 ±0.016
icu-mortality-on-hiridLSTM
AUPRC: 0.600±0.009
icu-mortality-on-hiridLGBM
AUPRC: 0.546±0.008
icu-mortality-on-hiridLGBM ( + hand crafted features)
AUPRC: 0.626±0.000
icu-mortality-on-hiridTCN
AUPRC: 0.602±0.011
kidney-function-on-hiridLSTM
MAE: 0.50±0.01
kidney-function-on-hiridGRU
MAE: 0.49±0.02
kidney-function-on-hiridLGBM ( + hand crafted features)
MAE: 0.45±0.00
kidney-function-on-hiridTransformer
MAE: 0.48±0.02
kidney-function-on-hiridTCN
MAE: 0.50±0.01
kidney-function-on-hiridLGBM
MAE: 0.45±0.00
patient-phenotyping-on-hiridTCN
Balanced Accuracy: 41.6±2.3
patient-phenotyping-on-hiridLGBM
Balanced Accuracy: 40.4±0.8
patient-phenotyping-on-hiridLGBM ( + hand crafted features)
Balanced Accuracy: 45.8±2.0
patient-phenotyping-on-hiridTransformer
Balanced Accuracy: 42.7±1.4
patient-phenotyping-on-hiridGRU
Balanced Accuracy: 39.2±2.1
patient-phenotyping-on-hiridLogistic Regression
Balanced Accuracy: 39.1±0.0
patient-phenotyping-on-hiridLSTM
Balanced Accuracy: 39.5±1.2
remaining-length-of-stay-on-hiridLGBM ( + hand crafted features)
MAE: 57.0±0.3
remaining-length-of-stay-on-hiridLGBM
MAE: 56.9±0.4
remaining-length-of-stay-on-hiridTransformer
MAE: 59.5±2.8
remaining-length-of-stay-on-hiridTCN
MAE: 59.8±2.8
remaining-length-of-stay-on-hiridLSTM
MAE: 60.7±1.6
remaining-length-of-stay-on-hiridGRU
MAE: 60.6±0.9
respiratory-failure-on-hiridLSTM
AUPRC: 0.569±0.003
respiratory-failure-on-hiridLGBM ( + hand crafted features)
AUPRC: 0.604±0.002
respiratory-failure-on-hiridTCN
AUPRC: 0.589±0.003
respiratory-failure-on-hiridGRU
AUPRC: 0.592±0.003
respiratory-failure-on-hiridLogistic Regression
AUPRC: 0.530±0.000
respiratory-failure-on-hiridLGBM
AUPRC: 0.585±0.001
respiratory-failure-on-hiridTransformer
AUPRC: 0.594±0.003

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp