Question: 2 0 6 Chapter 3 Classification Table 3 . 6 . Data set for Exercise 4 . table [ [ Instance , a 1

206
Chapter 3 Classification
Table 3.6. Data set for Exercise 4.
\table[[Instance,a1,a2,a3,Target Class],[1,T,T,1.0,+],[2,T,T,6.0,+],[3,T,F,5.0,-],[4,F,F,4.0,+],[5,F,T,7.0,-],[6,F,T,3.0,-],[7,F,F,8.0,-],[8,T,F,7.0,+],[9,F,T,5.0,-]]
(d) Compute the Gini index for the Car Type attribute using multiway split.
(e) Compute the Gini index for the Shirt Size attribute using multiway split.
(f) Which attribute is better, Gender, Car Type, or Shirt Size?
(g) Explain why Customer ID should not be used as the attribute test condition even though it has the lowest Gini.
4. Consider the training examples shown in Table 3.6 for a binary classification problem.
(a) What is the entropy of this collection of training examples with respect to the class attribute?
(b) What are the information gains of a1 and a2 relative to these training examples?
(c) For a3, which is a continuous attribute, compute the information gain for every possible split.
(d) What is the best split (among a1,a2, and a3) according to the information gain?
(e) What is the best split (between a1 and a2) according to the misclassification error rate?
(f) What is the best split (between a1 and a2) according to the Gini index?
2 0 6 Chapter 3 Classification Table 3 . 6 . Data

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!