Card

About Dataset

Introduction

Purpose: This dataset sheds light on factors influencing liver health.
Significance: It aids in predictive modeling and healthcare strategies.

Variables

  • Age: Range: 20 to 80 years.
  • Gender: Male (0) or Female (1).
  • BMI (Body Mass Index): Range: 15 to 40.
  • Alcohol Consumption: Range: 0 to 20 units per week.
  • Smoking: No (0) or Yes (1).
  • Genetic Risk: Low (0), Medium (1), High (2).
  • Physical Activity: Range: 0 to 10 hours per week.
  • Diabetes: No (0) or Yes (1).
  • Hypertension: No (0) or Yes (1).
  • Liver Function Test: Range: 20 to 100.
  • Diagnosis: Binary indicator (0 or 1) of liver disease presence.

Dataset Composition

  • Size: 1500 records.
  • Composition: Features encompassing demographic, lifestyle, and health indicators.

Applications

  • Research Insights: Explore correlations and predictive models for liver disease.
  • Healthcare Interventions: Inform preventive measures and personalized healthcare.
  • Data Science: Dataset suitable for data science.

Limitations

  • External Factors: Additional factors impacting liver health not included.

Disclaimer

-This dataset has been preprocessed and cleaned to ensure that users can focus on the most critical aspects of their analysis. The preprocessing steps were designed to eliminate noise and irrelevant information, allowing you to concentrate on developing and fine-tuning your predictive models.

Conclusion

  • Summary: Comprehensive dataset offering insights into liver disease risk factors.

Dataset Usage and Attribution Notice

This dataset, shared by Rabie El Kharoua, is original and has never been shared before. It is made available under the CC BY 4.0 license, allowing anyone to use the dataset in any form as long as proper citation is given to the author. A DOI is provided for proper referencing. Please note that duplication of this work within Kaggle is not permitted.

Exclusive Synthetic Dataset

This dataset is synthetic and was generated for educational purposes, making it ideal for data science and machine learning projects. It is an original dataset, owned by Mr. Rabie El Kharoua, and has not been previously shared. You are free to use it under the license outlined on the data card. The dataset is offered without any guarantees. Details about the data provider will be shared soon.