--- a +++ b/README.md @@ -0,0 +1,10 @@ +## Smokers-Classification +A repository of implemented ML Algorithms to classification smokers from multivariate data. +As part of the process of learning the research problem and the data, a preliminary file of EDA is also attached. +The task was given as part of a ML course at the university. + +The subtasks given as part of the task: +1) Performing a pre-processing process on the data, handling NaN data and values at the edges, learning the most significant features, examining distributions of variables in the data, performing discretization and categorization. +2) Application of 2 Supervised ML models - NN & Decision Tree, while adjusting the hyper parameters with the help of Grid-Search. Using Naptune.ai for monitoring. +3) Application of K-Means clustering. +4) Application of Un-supervised model with the help of integration and pre-training of NN and then application of Voting Classifier and create ensemble model.