Question: 12. Consider a labeled data set containing 100 data instances, which is randomly partitioned into two sets A and B, each containing 50 instances. We

12. Consider a labeled data set containing 100
12. Consider a labeled data set containing 100 data instances, which is randomly partitioned into two sets A and B, each containing 50 instances. We use A as the training set to learn two decision trees, Tig with 10 leaf nodes and Tion with 100 leaf nodes. The accuracies of the two decision trees on data sets A and B are shown in Table 3.7. (a) Based on the accuracies shown in Table 3.7, which classification model would you expect to have better performance on unseen instances? (b) Now, you tested Tio and Ting on the entire data set (A + B) and found that the classification accuracy of Tio on data set (A+ B) is 0.85, whereas the classification accuracy of 7jog on the data set (A + B) is 0.87. Based on this new information and your observations from Table 3.7, which classification model would you finally choose for classification

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!