|
a |
|
b/README.md |
|
|
1 |
## Smokers-Classification |
|
|
2 |
A repository of implemented ML Algorithms to classification smokers from multivariate data. |
|
|
3 |
As part of the process of learning the research problem and the data, a preliminary file of EDA is also attached. |
|
|
4 |
The task was given as part of a ML course at the university. |
|
|
5 |
|
|
|
6 |
The subtasks given as part of the task: |
|
|
7 |
1) Performing a pre-processing process on the data, handling NaN data and values at the edges, learning the most significant features, examining distributions of variables in the data, performing discretization and categorization. |
|
|
8 |
2) Application of 2 Supervised ML models - NN & Decision Tree, while adjusting the hyper parameters with the help of Grid-Search. Using Naptune.ai for monitoring. |
|
|
9 |
3) Application of K-Means clustering. |
|
|
10 |
4) Application of Un-supervised model with the help of integration and pre-training of NN and then application of Voting Classifier and create ensemble model. |