|
a |
|
b/README.md |
|
|
1 |
<h1>About Dataset</h1> |
|
|
2 |
|
|
|
3 |
<h2>Similar Datasets</h2> |
|
|
4 |
<ul> |
|
|
5 |
<li><strong>🌟 CERN Electron Collision Data:</strong> <a href="#">LINK</a></li> |
|
|
6 |
<li>Hepatitis C Dataset: <a href="#">LINK</a></li> |
|
|
7 |
<li>Body Fat Prediction Dataset: <a href="#">LINK</a></li> |
|
|
8 |
<li>Cirrhosis Prediction Dataset: <a href="#">LINK</a></li> |
|
|
9 |
<li>Heart Failure Prediction Dataset: <a href="#">LINK</a></li> |
|
|
10 |
<li>Stellar Classification Dataset – SDSS17: <a href="#">LINK</a></li> |
|
|
11 |
<li>Wind Speed Prediction Dataset: <a href="#">LINK</a></li> |
|
|
12 |
<li>Spanish Wine Quality Dataset: <a href="#">LINK</a></li> |
|
|
13 |
</ul> |
|
|
14 |
|
|
|
15 |
<h2>Context</h2> |
|
|
16 |
<p> |
|
|
17 |
According to the World Health Organization (WHO), <strong>stroke is the second leading cause of death globally</strong>, accounting for approximately <strong>11% of total deaths</strong>. |
|
|
18 |
</p> |
|
|
19 |
|
|
|
20 |
<p> |
|
|
21 |
This dataset is designed to help predict whether a patient is likely to have a stroke, based on input parameters such as gender, age, comorbid conditions, and smoking status. |
|
|
22 |
Each row in the dataset contains relevant patient information that could be used in a classification model. |
|
|
23 |
</p> |
|
|
24 |
|
|
|
25 |
<h2>Attribute Information</h2> |
|
|
26 |
<ol> |
|
|
27 |
<li><strong>id</strong>: Unique identifier</li> |
|
|
28 |
<li><strong>gender</strong>: "Male", "Female", or "Other"</li> |
|
|
29 |
<li><strong>age</strong>: Age of the patient</li> |
|
|
30 |
<li><strong>hypertension</strong>: 0 = No hypertension, 1 = Has hypertension</li> |
|
|
31 |
<li><strong>heart_disease</strong>: 0 = No heart disease, 1 = Has heart disease</li> |
|
|
32 |
<li><strong>ever_married</strong>: "No" or "Yes"</li> |
|
|
33 |
<li><strong>work_type</strong>: "children", "Govt_job", "Never_worked", "Private", or "Self-employed"</li> |
|
|
34 |
<li><strong>Residence_type</strong>: "Rural" or "Urban"</li> |
|
|
35 |
<li><strong>avg_glucose_level</strong>: Average blood glucose level</li> |
|
|
36 |
<li><strong>bmi</strong>: Body Mass Index</li> |
|
|
37 |
<li><strong>smoking_status</strong>: "formerly smoked", "never smoked", "smokes", or "Unknown"</li> |
|
|
38 |
<li><strong>stroke</strong>: 1 = Had a stroke, 0 = Did not have a stroke</li> |
|
|
39 |
</ol> |
|
|
40 |
|
|
|
41 |
<p><em>Note:</em> "Unknown" in <code>smoking_status</code> indicates missing or unavailable information for the patient.</p> |
|
|
42 |
|
|
|
43 |
<h2>Acknowledgements</h2> |
|
|
44 |
<p> |
|
|
45 |
<em>(Confidential Source)</em> — This dataset is intended for <strong>educational purposes only</strong>.<br> |
|
|
46 |
If you use this dataset in your research or projects, please credit the author. |
|
|
47 |
</p> |