Question: explain this Gulia and Praveen Rani review the classification algorithms that have been built from the The Indian Liver Patient Dataset (ILPD), including random forests
explain this Gulia and Praveen Rani review the classification algorithms that have been built from the The Indian Liver Patient Dataset (ILPD), including random forests (RFs) and support vector machines (SVMs). A more recent review from BanuPriya and Tamilselvi describe the accuracies of additional models including Bayesian Networks, which is further built on by the work of Aswathy who evaluates the performance of logistic regression (LR) models on the ILPD. We replicate the methods of these studies, reproducing RF, SVM, Gaussian Nave Bayes (GNB) and LR classifiers. We implement these models across four experiments, in which we evaluate the overall and sex-stratified performance of the classifiers. Initially, we reproduce existing studies, building a predictive algorithm on the full unbalanced dataset to predict liver disease. Data were divided into test and training subsets (30%/70%), hyperparameters were tuned using GridSearchCV(), the model was trained on the mixed-sex data and results were stratified by sex to give the evaluation metrics for males/females separately. We do this 100 times (building, training and testing separate models) and report average results with SD over the 100 runs. This is done for all four classifiers resulting in four results tables (online supplemental material B Spreadsheets, 'Experiment 3.1.1RF''Experiment 3.1.1 GNB')
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
