Dataset Column Descriptions:
id:
Type: Integer
Description: Unique identifier for each row in the dataset.
disease_name:
Type: Categorical
Description: The name of the disease (e.g., "Tuberculosis", "Pneumonia", "COPD", etc.).
type:
Type: Categorical
Description: Indicates whether the disease is Communicable, Non-communicable, or Water-borne.
symptoms:
Type: Categorical (Multi-value Text)
Description: A list of symptoms associated with the disease (e.g., "Cough, fever, difficulty breathing").
causes:
Type: Categorical (Multi-value Text)
Description: The cause of the disease (e.g., "Mycobacterium tuberculosis bacteria", "Smoking, genetic factors").
transmission_mode:
Type: Categorical
Description: The method by which the disease spreads (e.g., "Airborne", "Contaminated water"). May be None for non-communicable diseases.
treatment:
Type: Categorical (Text)
Description: Treatment options for the disease (e.g., "Antibiotics", "Bronchodilators, steroids").
prevention:
Type: Categorical (Text)
Description: Prevention methods to reduce the likelihood of contracting the disease (e.g., "Vaccination, good hygiene", "Avoid smoking").
heart_rate:
Type: Numerical (Float)
Description: The patient's heart rate in beats per minute (BPM), recorded during the infection.
blood_pressure:
Type: Numerical (Text: "Systolic/Diastolic")
Description: The patient's blood pressure measurement, represented as systolic/diastolic (e.g., "120/80").
fever:
Type: Categorical (Binary: 0 or 1)
Description: Whether the patient has a fever (1 = Yes, 0 = No).
body_temperature:
Type: Numerical (Float)
Description: The patient's body temperature in degrees Celsius.
duration_of_infection:
Type: Numerical (Integer)
Description: Duration of the infection in days.
Special Case (No Infection)