Card

About Dataset

This dataset provides a detailed and structured overview of oral cancer cases worldwide. It includes key risk factors, symptoms, cancer staging, survival rates, treatment approaches, and economic burden to facilitate research and prediction modeling. The dataset is based on real-world oral cancer statistics, aligning with global health reports and studies.

Key Highlights:

Covers high-incidence regions (India, Pakistan, Sri Lanka, Taiwan) and emerging trends in Western nations.
Includes tobacco, alcohol, HPV infection, betel quid use, and dietary factors as primary risk factors.
Captures economic burden (treatment costs, workdays lost) to assess the financial impact of oral cancer.
Provides cancer staging, survival rates, and early diagnosis indicators for better treatment predictions.
This dataset is valuable for medical professionals, researchers, data scientists, and policymakers aiming to develop early detection models, assess regional disparities, and improve cancer prevention strategies.

Columns Overview

ID – Unique identifier
Country – Country name
Age – Age of the individual
Gender – Male/Female
Tobacco Use – Yes/No
Alcohol Consumption – Yes/No
HPV Infection – Yes/No
Betel Quid Use – Yes/No
Chronic Sun Exposure – Yes/No
Poor Oral Hygiene – Yes/No
Diet (Fruits & Vegetables Intake) – Low/Moderate/High
Family History of Cancer – Yes/No
Compromised Immune System – Yes/No
Oral Lesions – Yes/No
Unexplained Bleeding – Yes/No
Difficulty Swallowing – Yes/No
White or Red Patches in Mouth – Yes/No
Tumor Size (cm) – Numerical value
Cancer Stage – 0 (No Cancer), 1, 2, 3, 4
Treatment Type – Surgery/Radiation/Chemotherapy/Targeted Therapy/No Treatment
Survival Rate (5-Year, %)
Cost of Treatment (USD)
Economic Burden (Lost Workdays per Year)
Early Diagnosis (Yes/No)
Oral Cancer (Diagnosis) – Yes/No (Target Variable)