|
a/README.md |
|
b/README.md |
1 |
# Using Machine Learning to Analysis of Blood Donation |
1 |
# Using Machine Learning to Analysis of Blood Donation
|
2 |
## Which features impact high blood donation? |
2 |
## Which features impact high blood donation?
|
3 |
 |
3 |

|
4 |
* Blood donation is the only method to secure accessibility of blood products for patients in need of transfusion and accessibility is solely dependent on voluntary non-remunerated blood donors. |
4 |
* Blood donation is the only method to secure accessibility of blood products for patients in need of transfusion and accessibility is solely dependent on voluntary non-remunerated blood donors.
|
5 |
* Many countries' health authorities and blood centers are striving to have sufficient blood donations to secure a stable blood supply in the country or region. |
5 |
* Many countries' health authorities and blood centers are striving to have sufficient blood donations to secure a stable blood supply in the country or region.
|
6 |
* In this case study, we aimed to find features that could help improve blood donation performance in South Korea. |
6 |
* In this case study, we aimed to find features that could help improve blood donation performance in South Korea.
|
7 |
* The objective of this case study is as follows: |
7 |
* The objective of this case study is as follows:
|
8 |
* Analyze the trend of blood donations in South Korea since 2005 |
8 |
* Analyze the trend of blood donations in South Korea since 2005
|
9 |
* Identify which features have a significant impact on high blood donation |
9 |
* Identify which features have a significant impact on high blood donation
|
10 |
* Recommendations for blood centers to increase blood donation |
10 |
* Recommendations for blood centers to increase blood donation
|
11 |
* To identify important features, we built various machine-learning models from simple Logistic Regression to the XGBoosting Classifier model. |
11 |
* To identify important features, we built various machine-learning models from simple Logistic Regression to the XGBoosting Classifier model.
|
12 |
* Platforms used for this project |
12 |
* Platforms used for this project
|
13 |
* Python: Pandas, Scikit-learn, Xgboost |
13 |
* Python: Pandas, Scikit-learn, Xgboost
|
14 |
* Visualization: Tableau, Seaborn |
14 |
* Visualization: Tableau, Seaborn
|
15 |
## Project Outline |
15 |
## Project Outline
|
16 |
1. Plan Stage: Data collection, Data dictionary, Pre-processing & Feature engineering |
16 |
1. Plan Stage: Data collection, Data dictionary, Pre-processing & Feature engineering
|
17 |
2. Analyze Stage: Exploratory Data Analysis (EDA) |
17 |
2. Analyze Stage: Exploratory Data Analysis (EDA)
|
18 |
3. Construct Stage: Prepare, Construct, and train & test models |
18 |
3. Construct Stage: Prepare, Construct, and train & test models
|
19 |
4. Execute Stage: Interpret model performance, conclusion (recommendation) |
19 |
4. Execute Stage: Interpret model performance, conclusion (recommendation)
|
20 |
## Note for this project |
20 |
## Note for this project
|
21 |
* The data used for this project is entirely from public open data, extracted from KOSIS (Korean Statistical Information Service). No internal data was included. |
21 |
* The data used for this project is entirely from public open data, extracted from KOSIS (Korean Statistical Information Service). No internal data was included.
|
22 |
* This project was worked with <ins>**Yoojin (Audrey) Jung**</ins>, a data analyst, with a master's degree from the University of Illinois Urbana-Champaign (UIUC). |
22 |
* This project was worked with <ins>**Yoojin (Audrey) Jung**</ins>, a data analyst, with a master's degree from the University of Illinois Urbana-Champaign (UIUC).
|
23 |
* **(Skills)** Data Analytics, Visualization (Tableau), SQL, R, Python |
23 |
* **(Skills)** Data Analytics, Visualization (Tableau), SQL, R, Python
|
24 |
* **(Contributions to this project)** EDA (Processing & Visualization), Tableau, model design (logistic regression, decision tree) |
24 |
* **(Contributions to this project)** EDA (Processing & Visualization), Tableau, model design (logistic regression, decision tree)
|
25 |
* **(LinkedIn Profile)** https://www.linkedin.com/in/yoojin-jung/ |
25 |
* **(LinkedIn Profile)** https://www.linkedin.com/in/yoojin-jung/
|
26 |
## Citation |
26 |
## Citation
|
27 |
If you use (or refer to) this project in your research (study), please cite it as follows: |
27 |
If you use (or refer to) this project in your research (study), please cite it as follows:
|
28 |
* Author: Hyunjin and Yoojin |
28 |
* Author: Hyunjin and Yoojin
|
29 |
* Title: Using Machine Learning to Analysis of Blood Donation |
29 |
* Title: Using Machine Learning to Analysis of Blood Donation
|
30 |
* Year: 2024 |
30 |
* Year: 2024
|
31 |
* Publisher: Hyunjin-Austin |
31 |
* Publisher: Hyunjin-Austin
|
32 |
* Journal: Blood-Donation-Analysis |
32 |
* Journal: Blood-Donation-Analysis
|
33 |
* How published: https://github.com/Hyunjin-Austin/Blood-Donation-Analysis/ <br><br> |
33 |
* How published: https://github.com/Hyunjin-Austin/Blood-Donation-Analysis/ <br><br>
|
34 |
|
34 |
|
35 |
Thank you for citing our work! |
35 |
Thank you for citing our work!
|