This repository contains a Jupyter Notebook that implements three different machine learning models to create patient risk profiles using healthcare and clinical datasets. This is only a sample model. The models included are:
The script expects a healthcare dataset in CSV format. The dataset should include a Risk
column as the target variable (0: Low Risk, 1: High Risk) and a PatientID
column, which will be dropped during processing. All other numerical features will be used for training the models.
Ensure you have the following dependencies installed before running the notebook:
pip install pandas numpy scikit-learn xgboost
git clone https://github.com/rkumar1010/patient-risk-profiling.git
cd patient-risk-profiling
Place your dataset in the project directory and update the healthcare_data.csv
filename in the notebook if necessary.
Run the Jupyter Notebook:
jupyter notebook patient_risk_models.ipynb
The notebook compares model performance based on accuracy and classification metrics. The best-performing model can be selected for further deployment.
Feel free to fork this repository and submit pull requests for improvements, additional models, or dataset enhancements.
This project is licensed under the MIT License.
For any questions or suggestions, please open an issue in the repository or contact the maintainers.