[ec4a04]: / README.md

Download this file

44 lines (44 with data), 3.1 kB

About Dataset

CBC Dataset from NHANES

This dataset contains cleaned and organized data from the National Health and Nutrition Examination Survey (NHANES), focusing on complete blood count (CBC) parameters. The dataset includes various blood test measures such as white blood cell count (WBC), red blood cell count (RBC), hemoglobin (HGB), hematocrit (HCT), mean corpuscular volume (MCV), platelet count (PLT), and others, collected from a large population sample.

The data has been carefully cleaned by:

  • Removing irrelevant columns
  • Adjusting feature names for consistency
  • Handling missing values

It includes over 100,000 records and aims to provide insights into blood disorders and their correlation with health metrics. This dataset is suitable for researchers and data scientists working in the fields of hematology, diagnostics, and epidemiology.

Data Fields

  • WBC: White blood cell count
  • LY%: Lymphocyte percentage
  • MO%: Monocyte percentage
  • NE%: Neutrophil percentage
  • EO%: Eosinophil percentage
  • BA%: Basophil percentage
  • LY#: Lymphocyte count
  • MO#: Monocyte count
  • NE#: Neutrophil count
  • EO#: Eosinophil count
  • BA#: Basophil count
  • RBC: Red blood cell count
  • HGB: Hemoglobin level
  • HCT: Hematocrit level
  • MCV: Mean corpuscular volume
  • MCHC: Mean corpuscular hemoglobin concentration
  • MCH: Mean corpuscular hemoglobin
  • RDW: Red cell distribution width
  • PLT: Platelet count
  • MPV: Mean platelet volume

Use Cases

This dataset is valuable for analyzing various blood conditions, including:

  • Anemia (microcytic, normocytic, and macrocytic)
  • Leukemia
  • Thrombocytopenia

It is ideal for use in machine learning models, health analytics, and diagnostic research.

Licensing

The data is sourced from NHANES, which is publicly available. The dataset is cleaned and modified for easier analysis and use. The dataset is shared under the CC BY 4.0 license.


Feel free to explore, analyze, and apply this dataset to your own research or projects!