|
a |
|
b/README.md |
|
|
1 |
<div class="sc-cmRAlD dkqmWS"><div class="sc-UEtKG dGqiYy sc-flttKd cguEtd"><div class="sc-fqwslf gsqkEc"><div class="sc-cBQMlg kAHhUk"><h2 class="sc-dcKlJK sc-cVttbi gqEuPW ksnHgj">About Dataset</h2></div></div></div><div class="sc-jgvlka jFuPjz"><div class="sc-gzqKSP ktvwwo"><div style="min-height: 80px;"><div class="sc-etVRix jqYJaa sc-bMmLMY ZURWJ"><h1>CBC Dataset from NHANES</h1> |
|
|
2 |
<p>This dataset contains cleaned and organized data from the <strong>National Health and Nutrition Examination Survey (NHANES)</strong>, focusing on complete blood count (CBC) parameters. The dataset includes various blood test measures such as white blood cell count (WBC), red blood cell count (RBC), hemoglobin (HGB), hematocrit (HCT), mean corpuscular volume (MCV), platelet count (PLT), and others, collected from a large population sample.</p> |
|
|
3 |
<p>The data has been carefully cleaned by:</p> |
|
|
4 |
<ul> |
|
|
5 |
<li>Removing irrelevant columns</li> |
|
|
6 |
<li>Adjusting feature names for consistency</li> |
|
|
7 |
<li>Handling missing values</li> |
|
|
8 |
</ul> |
|
|
9 |
<p>It includes over 100,000 records and aims to provide insights into blood disorders and their correlation with health metrics. This dataset is suitable for researchers and data scientists working in the fields of hematology, diagnostics, and epidemiology.</p> |
|
|
10 |
<h2>Data Fields</h2> |
|
|
11 |
<ul> |
|
|
12 |
<li><strong>WBC</strong>: White blood cell count</li> |
|
|
13 |
<li><strong>LY%</strong>: Lymphocyte percentage</li> |
|
|
14 |
<li><strong>MO%</strong>: Monocyte percentage</li> |
|
|
15 |
<li><strong>NE%</strong>: Neutrophil percentage</li> |
|
|
16 |
<li><strong>EO%</strong>: Eosinophil percentage</li> |
|
|
17 |
<li><strong>BA%</strong>: Basophil percentage</li> |
|
|
18 |
<li><strong>LY#</strong>: Lymphocyte count</li> |
|
|
19 |
<li><strong>MO#</strong>: Monocyte count</li> |
|
|
20 |
<li><strong>NE#</strong>: Neutrophil count</li> |
|
|
21 |
<li><strong>EO#</strong>: Eosinophil count</li> |
|
|
22 |
<li><strong>BA#</strong>: Basophil count</li> |
|
|
23 |
<li><strong>RBC</strong>: Red blood cell count</li> |
|
|
24 |
<li><strong>HGB</strong>: Hemoglobin level</li> |
|
|
25 |
<li><strong>HCT</strong>: Hematocrit level</li> |
|
|
26 |
<li><strong>MCV</strong>: Mean corpuscular volume</li> |
|
|
27 |
<li><strong>MCHC</strong>: Mean corpuscular hemoglobin concentration</li> |
|
|
28 |
<li><strong>MCH</strong>: Mean corpuscular hemoglobin</li> |
|
|
29 |
<li><strong>RDW</strong>: Red cell distribution width</li> |
|
|
30 |
<li><strong>PLT</strong>: Platelet count</li> |
|
|
31 |
<li><strong>MPV</strong>: Mean platelet volume</li> |
|
|
32 |
</ul> |
|
|
33 |
<h2>Use Cases</h2> |
|
|
34 |
<p>This dataset is valuable for analyzing various blood conditions, including:</p> |
|
|
35 |
<ul> |
|
|
36 |
<li><strong>Anemia</strong> (microcytic, normocytic, and macrocytic)</li> |
|
|
37 |
<li><strong>Leukemia</strong></li> |
|
|
38 |
<li><strong>Thrombocytopenia</strong></li> |
|
|
39 |
</ul> |
|
|
40 |
<p>It is ideal for use in machine learning models, health analytics, and diagnostic research.</p> |
|
|
41 |
<h2>Licensing</h2> |
|
|
42 |
<p>The data is sourced from <strong>NHANES</strong>, which is publicly available. The dataset is cleaned and modified for easier analysis and use. The dataset is shared under the <strong>CC BY 4.0</strong> license.</p> |
|
|
43 |
<hr> |
|
|
44 |
<p>Feel free to explore, analyze, and apply this dataset to your own research or projects!</p></div></div> |