MV-MR: multi-views and multi-representations for self-supervised learning and knowledge distillation
The recent success of machine learning methods applied to time series
collected from Intensive Care Units (ICU) exposes the lack of standardized
machine learning benchmarks for developing and comparing such methods. While
raw datasets, such as MIMIC-IV or eICU, can be freely accessed on Physionet,
the choice of tasks and pre-processing is often chosen ad-hoc for each
publication, limiting comparability across publications. In this work, we aim
to improve this situation by providing a benchmark covering a large spectrum of
ICU-related tasks. Using the HiRID dataset, we define multiple clinically
relevant tasks in collaboration with clinicians. In addition, we provide a
reproducible end-to-end pipeline to construct both data and labels. Finally, we
provide an in-depth analysis of current state-of-the-art sequence modeling
methods, highlighting some limitations of deep learning approaches for this
type of data. With this benchmark, we hope to give the research community the
possibility of a fair comparison of their work.