Question: Question 2 ) Consider the training examples shown in Table 4.7 for a binary classification problem. a)Compute the Gini index for the overall collection of
Question 2 ) Consider the training examples shown in Table 4.7 for a binary classification
problem.
a)Compute the Gini index for the overall collection of training examples.
b)Compute the Gini index for the Customer ID attribute.
c)Compute the Gini index for the Gender attribute.
d)Compute the Gini index for the Car Type attribute using multiway
e)Compute the Gini index for the Shirt Size attribute using multiway
split.
f)Which attribute is better, Gender, Car Type, or Shirt Size?
g)Explain why Customer ID should not be used as the attribute test
condition even though it has the lowest Gini.
Table 4.7. Data set for Exercise 2. Customer IDT ender Car Type Shirt Size Class M Family Small CO Sports Medium CO M Sports Medium CO M Sports Large CO Sports Extra Large CO M Sports Extra Large CO F Sports Small CO Small Sports CO Sports Medium CO 10 F Luxury Large CO 11 M Family Large C1 12 M Family Extra Large C1 13 M Family Medium C1 14 Luxury Extra Large C1 Luxury Small C1 16 Luxury Small C1 17 Luxury Medium C1 18 Luxury Medium C1 19 Luxury Medium C1 20 Large C1 Luxury
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
