Question: Consider the task of building a decision tree classifier from random data, where the attribute values are generated randomly irrespective of the class labels. Assume

Consider the task of building a decision tree classifier from random data, where the attribute values are generated randomly irrespective of the class labels. Assume the data set contains records from two classes, + and ?. Half of the data set is used for training while the remaining half is used for testing. Answer each question below and justify your answer. (i) Suppose there are an equal number of positive and negative records in the data and the decision tree classifier predicts every test record to be positive. What is the expected error rate of the classifier on the test data? (ii) Repeat the above analysis in (i) assuming that the classifier predicts each test record to be positive with probability 0.8 and to be negative with probability 0.2. (iii) Suppose two-thirds of the data belong to the positive class and the remaining one-third belong to the negative class. What is the expected error of a classifier that predicts every test record to be positive? (iv) Repeat the above analysis in (iii) assuming that the classifier predicts each test record to be positive class with probability 2/3 and negative class with probability 1/3.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!