Question: Problem 3 ( 4 0 points ) Assume we get some data from a car insurance company in Table 1 , where there are 6
Problem points
Assume we get some data from a car insurance company in Table where there are data instances representing people, with attributes Age and Car and class label Risk Here Age is a continuous attribute. Now we will build decision trees for this data set.
Table : Data for Problem Age is numeric and Car is categorical. Risk gives the class label for each point: high H or low L
Let us consider a multiway split for the Car attribute using its unique values for partition What is the information gain if we choose the Car attribute to split the root node? points
Let us consider the binary splits for the Car attribute. Using information gain as the measure, which binary split of the Car attribute is the best at the root node? points
Between and which one do you prefer for splitting the root node using the Car attribute? Hint: Consider the GainRatio measure. points
Now, construct an entire decision tree for the given data set, using either information gain or GainRatio as the split point evaluation measure. You will need to consider the Age attribute for splitting the root node as well. Only consider binary splits of the Age attribute. You can leverage your calculations or conclusions in points
Classify the point Age CarSUV based on the constructed decision tree in points
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
