--- a +++ b/README.md @@ -0,0 +1,23 @@ +# Lung Cancer Predition ML Project +A Machine Learning project leveraging synthetic patient data to predict lung cancer risk. +<br> <br> +*Authors: [Krzysztof Adamczyk](https://github.com/KAdamczykk), [Paweł Florek](https://github.com/FlorekPawel), [Julia Kruk](https://github.com/krukj) & [Nadia Serafin](https://github.com/nadias03)* + + ## Data source +[Synthea dataset | Harvard Dataverse](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/GD5XWE) + ## Used technologies +- programming language: **Python** +- used technologies and libraries: + - numpy + - pandas + - seaborn, matplotlib + - sklearn + +## Project structure +``` +ML-Synthea-lung-cancer-prediction/ + ├── data/ # Raw data + ├── notebooks/ # Jupyter Notebooks + ├── validation_data/ # Data for validation + └── README.md # This file +```