--- a +++ b/README.md @@ -0,0 +1,47 @@ +<h1>About Dataset</h1> + +<h2>Similar Datasets</h2> +<ul> + <li><strong>🌟 CERN Electron Collision Data:</strong> <a href="#">LINK</a></li> + <li>Hepatitis C Dataset: <a href="#">LINK</a></li> + <li>Body Fat Prediction Dataset: <a href="#">LINK</a></li> + <li>Cirrhosis Prediction Dataset: <a href="#">LINK</a></li> + <li>Heart Failure Prediction Dataset: <a href="#">LINK</a></li> + <li>Stellar Classification Dataset – SDSS17: <a href="#">LINK</a></li> + <li>Wind Speed Prediction Dataset: <a href="#">LINK</a></li> + <li>Spanish Wine Quality Dataset: <a href="#">LINK</a></li> +</ul> + +<h2>Context</h2> +<p> +According to the World Health Organization (WHO), <strong>stroke is the second leading cause of death globally</strong>, accounting for approximately <strong>11% of total deaths</strong>. +</p> + +<p> +This dataset is designed to help predict whether a patient is likely to have a stroke, based on input parameters such as gender, age, comorbid conditions, and smoking status. +Each row in the dataset contains relevant patient information that could be used in a classification model. +</p> + +<h2>Attribute Information</h2> +<ol> + <li><strong>id</strong>: Unique identifier</li> + <li><strong>gender</strong>: "Male", "Female", or "Other"</li> + <li><strong>age</strong>: Age of the patient</li> + <li><strong>hypertension</strong>: 0 = No hypertension, 1 = Has hypertension</li> + <li><strong>heart_disease</strong>: 0 = No heart disease, 1 = Has heart disease</li> + <li><strong>ever_married</strong>: "No" or "Yes"</li> + <li><strong>work_type</strong>: "children", "Govt_job", "Never_worked", "Private", or "Self-employed"</li> + <li><strong>Residence_type</strong>: "Rural" or "Urban"</li> + <li><strong>avg_glucose_level</strong>: Average blood glucose level</li> + <li><strong>bmi</strong>: Body Mass Index</li> + <li><strong>smoking_status</strong>: "formerly smoked", "never smoked", "smokes", or "Unknown"</li> + <li><strong>stroke</strong>: 1 = Had a stroke, 0 = Did not have a stroke</li> +</ol> + +<p><em>Note:</em> "Unknown" in <code>smoking_status</code> indicates missing or unavailable information for the patient.</p> + +<h2>Acknowledgements</h2> +<p> +<em>(Confidential Source)</em> — This dataset is intended for <strong>educational purposes only</strong>.<br> +If you use this dataset in your research or projects, please credit the author. +</p>