|
a |
|
b/README.md |
|
|
1 |
<h1 align="center">π <span style="color: #8E44AD;">Breast Cancer Classification: ML Model</span> π</h1> |
|
|
2 |
|
|
|
3 |
<p align="center"> |
|
|
4 |
<b>Predicting Malignant or Benign Tumors with Logistic Regression</b><br> |
|
|
5 |
Early detection saves lives! This model helps in diagnosing breast cancer effectively, enabling better treatment decisions and improved outcomes. |
|
|
6 |
</p> |
|
|
7 |
|
|
|
8 |
--- |
|
|
9 |
|
|
|
10 |
## π <span style="color: #2980B9;">**Project Overview**</span> |
|
|
11 |
|
|
|
12 |
This project focuses on building a **Machine Learning Model** to classify breast tumors as either **malignant** or **benign** using the **Logistic Regression** algorithm. By analyzing tumor features such as **size, shape, and texture**, the model achieves an impressive **93% accuracy**, aiding in early detection and diagnosis. |
|
|
13 |
|
|
|
14 |
### π οΈ <span style="color: #D35400;">**Key Highlights**</span> |
|
|
15 |
- **Data Cleaning & Preprocessing**: Ensured high-quality data for robust predictions. |
|
|
16 |
- **Feature Analysis**: Examined critical tumor features, including size, shape, and texture. |
|
|
17 |
- **Logistic Regression**: Implemented the ML model for binary classification. |
|
|
18 |
- **High Accuracy**: Achieved a **93% accuracy score** for reliable predictions. |
|
|
19 |
- **Impact**: Improves treatment decisions, supports early diagnosis, and enhances patient outcomes. |
|
|
20 |
|
|
|
21 |
--- |
|
|
22 |
### π§ <span style="color: #9B59B6;">Technologies Used</span> |
|
|
23 |
- Language: Python π |
|
|
24 |
- Libraries: |
|
|
25 |
1. pandas and numpy for data manipulation |
|
|
26 |
2. sklearn for model building and evaluation |
|
|
27 |
3. matplotlib and seaborn for visualizations |
|
|
28 |
|
|
|
29 |
|
|
|
30 |
π― <span style="color: #C0392B;">**Project Workflow**</span> |
|
|
31 |
|
|
|
32 |
Data Cleaning & Preprocessing: |
|
|
33 |
1. Removed duplicates and missing values. |
|
|
34 |
2. Normalized features for better model performance. |
|
|
35 |
|
|
|
36 |
Exploratory Data Analysis (EDA): |
|
|
37 |
1. Visualized relationships between key features like tumor size, shape, and texture. |
|
|
38 |
|
|
|
39 |
Model Implementation: |
|
|
40 |
1. Used Logistic Regression for binary classification. |
|
|
41 |
2. Fine-tuned hyperparameters for optimal performance. |
|
|
42 |
Evaluation: |
|
|
43 |
|
|
|
44 |
Achieved 93% accuracy. |
|
|
45 |
Evaluated performance using the confusion matrix, precision, recall, and F1 score. |
|
|
46 |
|
|
|
47 |
--- |
|
|
48 |
**Results:** |
|
|
49 |
|
|
|
50 |
The model reliably classifies tumors as malignant or benign, supporting efficient diagnosis. |
|
|
51 |
### π§ <span style="color: #27AE60;">**Why Logistic Regression?**</span> |
|
|
52 |
- Logistic Regression is a powerful algorithm for binary classification tasks. |
|
|
53 |
- It provides probabilities for each class, making predictions interpretable. |
|
|
54 |
- Suitable for medical datasets with binary outcomes like malignant/benign classifications. |
|
|
55 |
--- |
|
|
56 |
### π <span style="color: #F4D03F;">**Results & Impact**</span> |
|
|
57 |
- Model Accuracy: 93% |
|
|
58 |
# Impact: |
|
|
59 |
- Facilitates early detection of breast cancer. |
|
|
60 |
- Improves treatment decisions and patient outcomes. |
|
|
61 |
- Supports doctors with data-driven insights. |
|
|
62 |
--- |
|
|
63 |
|
|
|
64 |
|
|
|
65 |
<h3 align="center" style="color: #5DADE2;">π If you found this project helpful, donβt forget to star the repo! π</h3> ``` |