Question: Given the data set below for the binary classification problem. Keep 2 decimal places. ID Attribute A Attribute B Attribute C Class label H H

Given the data set below for the binary
Given the data set below for the binary classification problem. Keep 2 decimal places. ID Attribute A Attribute B Attribute C Class label H H 3.6 Y T F 5.4 N 3 T T 2.1 Y T F 4.2 Y T 5.5 Y T 4.0 N F F 4.8 N 8 F F 2.3 N 9 T F 6.5 N 10 F F 3.7 N (a) Calculate the information gain when splitting on attribute A and B. Which attribute would the decision tree induction algorithm choose? Show all your work. [10] (b) Calculate the reduction in impurity using Gini index when splitting on A and B. Which attribute would the decision tree induction algorithm choose? Show all your work. [10] (c) Compare information gain and Gini index. Explain your results in part (a) and (b). [5] (d) For the attribute C, which is a continuous attribute, describe how you compute the information gain. 5]

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!