[d6904d]: / datasets / README.md

Download this file

48 lines (35 with data), 1.4 kB

Dataset

Supported datasets

  • Tongji Hospital (aka. TJH dataset)
  • HM Hospital (aka. CDSL dataset)

Folder structure

datasets/
    tongji/
        preprocess.py
        raw_data/
            ...
        processed_data/
            fold_{i=0,1,2,...}/
                train/
                    x.pkl
                    y.pkl
                    statistics.csv
                val/
                    x.pkl
                    y.pkl
                test/
                    x.pkl
                    y.pkl
    hm/
        preprocess.py
        ...

We also provide a Jupyter format pre-process script (preprocess.ipynb)

TJH Dataset

Refer: An interpretable mortality prediction model for COVID-19 patients

How to Access Data

Download Link: Supplementary Data 1 Training and external test datasets (refer supplementary information of the paper above)

Or you can download raw data (Excel format) and processed data (Python PKL format) in GitHub Releases Assets.

CDSL Dataset

You need to apply for the CDSL dataset if you need. Link: Covid Data Save Lives Dataset