--- a
+++ b/README.md
@@ -0,0 +1,20 @@
+<div class="sc-jegwdG lhLRCf"><div class="sc-UEtKG dGqiYy sc-flttKd cguEtd"><div class="sc-fqwslf gsqkEc"><div class="sc-cBQMlg kAHhUk"><h2 class="sc-dcKlJK sc-cVttbi gqEuPW ksnHgj">About Dataset</h2></div></div></div><div class="sc-davvxH eCVTlP"><div class="sc-jCNfQM ikRdXB"><div style="min-height: 80px;"><div class="sc-etVRix jqYJaa sc-gVIFzB gQKGyV"><p>The dataset was inspired by <a rel="noreferrer nofollow" aria-label="O. Gozes, H. Greenspan, Bone structures extraction and enchantment in chest radiographs via CNN trained on synthetic data, 2020. (opens in a new tab)" target="_blank" href="https://arxiv.org/pdf/2003.10839.pdf">O. Gozes, H. Greenspan, Bone structures extraction and enchantment in chest radiographs via CNN trained on synthetic data, 2020.</a></p>
+<p>The main idea is to convert 3D chest CT scans to a 2D chest X-rays, while extracting some additional information (i.e. bone structure) from the 3D image. </p>
+<p>5 CT scans datasets from <a rel="noreferrer nofollow" aria-label="https://www.cancerimagingarchive.net/collections/ (opens in a new tab)" target="_blank" href="https://www.cancerimagingarchive.net/collections/">https://www.cancerimagingarchive.net/collections/</a> were gathered:</p>
+<ul>
+<li>LIDC-IDRI</li>
+<li>CPTAC-LSCC</li>
+<li>CPTAC-LUAD</li>
+<li>TCGA-LUSC</li>
+<li>TCGA-LUAD</li>
+</ul>
+<p>CT scans with slice thickness of 1.25 or lower were selected. The threshold was set to ensure higher quality of the DRR images in the dataset.</p>
+<p>Each CT scan was digitally reconstructed onto 2D plane with proposed method in the paper - same parameters were used. The bone layer of the CT scan was extracted by using HU values from [300,700] range - the bone layer itself was also digitally reconstructed with the same method.</p>
+<p>Moreover, manual inspection was done to exclude bad quality DRR images. Bad quality was defined as:</p>
+<ul>
+<li>Noise artifacts of a CT device - mostly 'white stripe' appearance in upper and lower parts of DRR;</li>
+<li>CT was done with contrast - heart and vascular structures were visible in the selected HU range;</li>
+<li>Prominent features of not bone related material;</li>
+</ul>
+<p>In the end, roughly 10% of CT scans downloaded met the criteria described above. </p>
+<p>Each image was resized and saved in 512x512 format.</p></div></div></div>
\ No newline at end of file