Question: NOTE: For this activity, please use google colab. Upload the said data set to your File explorer. You may also use Jupiter for this activity.

NOTE: For this activity, please use google colab. Upload the said data set to your File explorer. You may also use Jupiter for this activity.

NOTE: FOR THIS EXERCISE - THIS IS A COMBINATION WITH THE PREVIOUS TOPIC THUS, TO USE THE CONVERSION OF CATEGORICAL VARIABLE TO DUMMY VARIABLES FOR THE MARITAL STATUS BEFORE YOU TRAIN YOUR C5.0 MODEL. :) THANK YOU!!

NOTE: For this activity, please use google colab. Upload the said data table file. adult_ch6_test Marital status,Income,Cap_Gains_Losses "Married","50K",0.051781 "Never-married","50K",0.000000 "Married",">50K",0.000000 table file adult_ch6_training Marital status,Income,Cap_Gains_Losses "Never-married","50K",0.000000 "Never-married",">50K",0.140841 "Married",">50K",0.051781 "Married",">50K",0.000000

5. Create a cost matrix, called the 3x cost matrix, that specifies a false positive is four times as bad as a false negative. Please use this code for your cost matrix 3x cost_matrix_ 3x={50K':3\} 6. Using the training data set, build a C5.0 model (Model 2) to predict a customer's Income using Marital Status and Capital Gains and Losses, using the 3x cost matrix. 7. Evaluate your predictions from Model 2 using the actual response values from the test data set. Add Overall Model Cost and Profit per Customer to the Model Evaluation Table. Calculate all the measures from the Model Evaluation Table. 8. Compare the evaluation measures from Model 1 and Model 2 using the 3x cost matrix. Discuss the strengths and weaknesses of each model

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!