|
a |
|
b/kaggle_3m/README.md |
|
|
1 |
# LGG Segmentation Dataset |
|
|
2 |
|
|
|
3 |
This dataset contains brain MR images together with manual FLAIR abnormality segmentation masks. |
|
|
4 |
The images were obtained from The Cancer Imaging Archive (TCIA). |
|
|
5 |
They correspond to 110 patients included in The Cancer Genome Atlas (TCGA) lower-grade glioma collection with at least fluid-attenuated inversion recovery (FLAIR) sequence and genomic cluster data available. |
|
|
6 |
Tumor genomic clusters and patient data is provided in `data.csv` file. |
|
|
7 |
|
|
|
8 |
|
|
|
9 |
All images are provided in `.tif` format with 3 channels per image. |
|
|
10 |
For 101 cases, 3 sequences are available, i.e. pre-contrast, FLAIR, post-contrast (in this order of channels). |
|
|
11 |
For 9 cases, post-contrast sequence is missing and for 6 cases, pre-contrast sequence is missing. |
|
|
12 |
Missing sequences are replaced with FLAIR sequence to make all images 3-channel. |
|
|
13 |
Masks are binary, 1-channel images. |
|
|
14 |
They segment FLAIR abnormality present in the FLAIR sequence (available for all cases). |
|
|
15 |
|
|
|
16 |
|
|
|
17 |
The dataset is organized into 110 folders named after case ID that contains information about source institution. |
|
|
18 |
Each folder contains MR images with the following naming convention: |
|
|
19 |
|
|
|
20 |
`TCGA_<institution-code>_<patient-id>_<slice-number>.tif` |
|
|
21 |
|
|
|
22 |
Corresponding masks have a `_mask` suffix. |