--- a
+++ b/kaggle_3m/README.md
@@ -0,0 +1,22 @@
+# LGG Segmentation Dataset
+
+This dataset contains brain MR images together with manual FLAIR abnormality segmentation masks.
+The images were obtained from The Cancer Imaging Archive (TCIA).
+They correspond to 110 patients included in The Cancer Genome Atlas (TCGA) lower-grade glioma collection with at least fluid-attenuated inversion recovery (FLAIR) sequence and genomic cluster data available.
+Tumor genomic clusters and patient data is provided in `data.csv` file.
+
+
+All images are provided in `.tif` format with 3 channels per image.
+For 101 cases, 3 sequences are available, i.e. pre-contrast, FLAIR, post-contrast (in this order of channels).
+For 9 cases, post-contrast sequence is missing and for 6 cases, pre-contrast sequence is missing.
+Missing sequences are replaced with FLAIR sequence to make all images 3-channel.
+Masks are binary, 1-channel images.
+They segment FLAIR abnormality present in the FLAIR sequence (available for all cases).
+
+
+The dataset is organized into 110 folders named after case ID that contains information about source institution.
+Each folder contains MR images with the following naming convention:
+
+`TCGA_<institution-code>_<patient-id>_<slice-number>.tif`
+
+Corresponding masks have a `_mask` suffix.