{
"nbformat": 4,
"nbformat_minor": 0,
"metadata": {
"colab": {
"provenance": [],
"authorship_tag": "ABX9TyPIL+qDWG1DjkojNL3QOdB/",
"include_colab_link": true
},
"kernelspec": {
"name": "python3",
"display_name": "Python 3"
},
"language_info": {
"name": "python"
}
},
"cells": [
{
"cell_type": "markdown",
"metadata": {
"id": "view-in-github",
"colab_type": "text"
},
"source": [
""
]
},
{
"cell_type": "markdown",
"source": [
"# **Lung cancer Prediction**\n",
"Graphs Analysis"
],
"metadata": {
"id": "hnJZ8QpEieeF"
}
},
{
"cell_type": "markdown",
"source": [
"Employing exploratory data analysis (EDA) techniques, an effective cancer prediction machine learning model can accurately assess an individual's cancer risk at a relatively low cost. By leveraging EDA to uncover patterns and insights within the data, this model can provide reliable and personalized cancer risk evaluations. The data for this analysis is sourced from the online lung cancer prediction system's website, offering a convenient and accessible platform for gathering the necessary information to train and validate the model."
],
"metadata": {
"id": "KtQi29oNivt1"
}
},
{
"cell_type": "code",
"source": [
"#Importing Libraries\n",
"import pandas as pd\n",
"import numpy as np\n",
"import matplotlib.pyplot as plt\n",
"import seaborn as sns\n",
"\n",
"#For ignoring warning\n",
"import warnings\n",
"warnings.filterwarnings(\"ignore\")"
],
"metadata": {
"id": "FT-92ToIjQUA"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "code",
"source": [
"\n",
"df=pd.read_csv('/content/sample_data/survey lung cancer.csv')\n",
"df"
],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/",
"height": 461
},
"id": "0cuDNeMLks5O",
"outputId": "b457d001-6fc7-4d94-8b82-6370eff4a90c"
},
"execution_count": null,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
" GENDER AGE SMOKING YELLOW_FINGERS ANXIETY PEER_PRESSURE CHRONIC DISEASE \\\n",
"0 M 69 No Yes Yes No No \n",
"1 M 74 Yes No No No Yes \n",
"2 F 59 No No No Yes No \n",
"3 M 63 Yes Yes Yes No No \n",
"4 F 63 No Yes No No No \n",
".. ... ... ... ... ... ... ... \n",
"304 F 56 No No No Yes Yes \n",
"305 M 70 Yes No No No No \n",
"306 M 58 Yes No No No No \n",
"307 M 67 Yes No Yes No No \n",
"308 M 62 No No No Yes No \n",
"\n",
" FATIGUE ALLERGY WHEEZING ALCOHOL CONSUMING COUGHING SHORTNESS OF BREATH \\\n",
"0 Yes No Yes Yes Yes Yes \n",
"1 Yes Yes No No No Yes \n",
"2 Yes No Yes No Yes Yes \n",
"3 No No No Yes No No \n",
"4 No No Yes No Yes Yes \n",
".. ... ... ... ... ... ... \n",
"304 Yes No No Yes Yes Yes \n",
"305 Yes Yes Yes Yes Yes Yes \n",
"306 No Yes Yes Yes Yes No \n",
"307 Yes Yes No Yes Yes Yes \n",
"308 Yes Yes Yes Yes No No \n",
"\n",
" SWALLOWING DIFFICULTY CHEST PAIN LUNG_CANCER \n",
"0 Yes Yes YES \n",
"1 Yes Yes YES \n",
"2 No Yes NO \n",
"3 Yes Yes NO \n",
"4 No No NO \n",
".. ... ... ... \n",
"304 Yes No YES \n",
"305 No Yes YES \n",
"306 No Yes YES \n",
"307 No Yes YES \n",
"308 Yes No YES \n",
"\n",
"[309 rows x 16 columns]"
],
"text/html": [
"\n",
"
\n", " | GENDER | \n", "AGE | \n", "SMOKING | \n", "YELLOW_FINGERS | \n", "ANXIETY | \n", "PEER_PRESSURE | \n", "CHRONIC DISEASE | \n", "FATIGUE | \n", "ALLERGY | \n", "WHEEZING | \n", "ALCOHOL CONSUMING | \n", "COUGHING | \n", "SHORTNESS OF BREATH | \n", "SWALLOWING DIFFICULTY | \n", "CHEST PAIN | \n", "LUNG_CANCER | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "M | \n", "69 | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "YES | \n", "
1 | \n", "M | \n", "74 | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "YES | \n", "
2 | \n", "F | \n", "59 | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "Yes | \n", "NO | \n", "
3 | \n", "M | \n", "63 | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "NO | \n", "
4 | \n", "F | \n", "63 | \n", "No | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "NO | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
304 | \n", "F | \n", "56 | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "YES | \n", "
305 | \n", "M | \n", "70 | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "Yes | \n", "YES | \n", "
306 | \n", "M | \n", "58 | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "YES | \n", "
307 | \n", "M | \n", "67 | \n", "Yes | \n", "No | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "Yes | \n", "YES | \n", "
308 | \n", "M | \n", "62 | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "YES | \n", "
309 rows × 16 columns
\n", "\n", " | GENDER | \n", "AGE | \n", "SMOKING | \n", "YELLOW_FINGERS | \n", "ANXIETY | \n", "PEER_PRESSURE | \n", "CHRONIC DISEASE | \n", "FATIGUE | \n", "ALLERGY | \n", "WHEEZING | \n", "ALCOHOL CONSUMING | \n", "COUGHING | \n", "SHORTNESS OF BREATH | \n", "SWALLOWING DIFFICULTY | \n", "CHEST PAIN | \n", "LUNG_CANCER | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "M | \n", "69 | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "YES | \n", "
1 | \n", "M | \n", "74 | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "YES | \n", "
2 | \n", "F | \n", "59 | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "Yes | \n", "NO | \n", "
3 | \n", "M | \n", "63 | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "NO | \n", "
4 | \n", "F | \n", "63 | \n", "No | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "NO | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
279 | \n", "F | \n", "59 | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "No | \n", "YES | \n", "
280 | \n", "F | \n", "59 | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "No | \n", "NO | \n", "
281 | \n", "M | \n", "55 | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "NO | \n", "
282 | \n", "M | \n", "46 | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "No | \n", "No | \n", "No | \n", "No | \n", "No | \n", "No | \n", "Yes | \n", "Yes | \n", "NO | \n", "
283 | \n", "M | \n", "60 | \n", "No | \n", "Yes | \n", "Yes | \n", "No | \n", "No | \n", "Yes | \n", "No | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "Yes | \n", "YES | \n", "
276 rows × 16 columns
\n", "