Question: Problem 4. Consider a dataset with three columns of binary attributes A1, A, and a binary label attribute Y. There are eight types of data

Problem 4. Consider a dataset with three columns of binary attributes A1, A, and a binary label attribute Y. There are eight types of data point in total, and their corresponding proportions in the dataset are captured in the column P. 0 type A1 A2 Y 1 0 0 0 2 0 0 1 3 0 1 0 4 0 1 5 1 0 0 6 1 0 1 7 1 1 0 8 1 1 1 OOOO P 8% 29% 2% 18% 16% 2% 1% 24% (a) What is the GINI index of the dataset? (b) What is the GINI index of the split on A ? (C) What is the GINI index of the split on Az? (d) Construct a decision tree of exactly five nodes for the dataset using the Hunt's algorithm. (e) What is the accuracy of the decision tree built in (a)
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
