Question: NOTE: For this activity, please use google colab. Upload the said data set to your File explorer. You may also use Jupiter for this activity.
NOTE: For this activity, please use google colab. Upload the said data set to your File explorer. You may also use Jupiter for this activity.
NOTE: FOR THIS EXERCISE - THIS IS A COMBINATION WITH THE PREVIOUS TOPIC THUS, TO USE THE CONVERSION OF CATEGORICAL VARIABLE TO DUMMY VARIABLES FOR THE MARITAL STATUS BEFORE YOU TRAIN YOUR C5.0 MODEL. :) THANK YOU!!
table file. adult_ch6_test Marital status,Income,Cap_Gains_Losses "Married","50K",0.051781 "Never-married","50K",0.000000 "Married",">50K",0.000000 table file adult_ch6_training Marital status,Income,Cap_Gains_Losses "Never-married","50K",0.000000 "Never-married",">50K",0.140841 "Married",">50K",0.051781 "Married",">50K",0.000000
5. Create a cost matrix, called the 3x cost matrix, that specifies a false positive is four times as bad as a false negative. Please use this code for your cost matrix 3x cost_matrix_ 3x={50K':3\} 6. Using the training data set, build a C5.0 model (Model 2) to predict a customer's Income using Marital Status and Capital Gains and Losses, using the 3x cost matrix. 7. Evaluate your predictions from Model 2 using the actual response values from the test data set. Add Overall Model Cost and Profit per Customer to the Model Evaluation Table. Calculate all the measures from the Model Evaluation Table. 8. Compare the evaluation measures from Model 1 and Model 2 using the 3x cost matrix. Discuss the strengths and weaknesses of each model
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
