--- a +++ b/kaggle_3m/README.md @@ -0,0 +1,22 @@ +# LGG Segmentation Dataset + +This dataset contains brain MR images together with manual FLAIR abnormality segmentation masks. +The images were obtained from The Cancer Imaging Archive (TCIA). +They correspond to 110 patients included in The Cancer Genome Atlas (TCGA) lower-grade glioma collection with at least fluid-attenuated inversion recovery (FLAIR) sequence and genomic cluster data available. +Tumor genomic clusters and patient data is provided in `data.csv` file. + + +All images are provided in `.tif` format with 3 channels per image. +For 101 cases, 3 sequences are available, i.e. pre-contrast, FLAIR, post-contrast (in this order of channels). +For 9 cases, post-contrast sequence is missing and for 6 cases, pre-contrast sequence is missing. +Missing sequences are replaced with FLAIR sequence to make all images 3-channel. +Masks are binary, 1-channel images. +They segment FLAIR abnormality present in the FLAIR sequence (available for all cases). + + +The dataset is organized into 110 folders named after case ID that contains information about source institution. +Each folder contains MR images with the following naming convention: + +`TCGA_<institution-code>_<patient-id>_<slice-number>.tif` + +Corresponding masks have a `_mask` suffix.