--- a
+++ b/README.md
@@ -0,0 +1,23 @@
+# Lung Cancer Predition ML Project
+A Machine Learning project leveraging synthetic patient data to predict lung cancer risk.
+<br> <br>
+*Authors: [Krzysztof Adamczyk](https://github.com/KAdamczykk), [Paweł Florek](https://github.com/FlorekPawel), [Julia Kruk](https://github.com/krukj) & [Nadia Serafin](https://github.com/nadias03)*
+
+ ## Data source
+[Synthea dataset | Harvard Dataverse](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/GD5XWE)
+ ## Used technologies 
+- programming language: **Python**
+- used technologies and libraries:
+  - numpy
+  - pandas
+  - seaborn, matplotlib
+  - sklearn
+
+## Project structure
+```
+ML-Synthea-lung-cancer-prediction/
+  ├── data/                       # Raw data
+  ├── notebooks/                  # Jupyter Notebooks    
+  ├── validation_data/            # Data for validation
+  └── README.md                   # This file
+```