Question: Suppose that we will learn a decision tree to predict if students pass a class (Y or N) using the dataset below, based on their

Suppose that we will learn a decision tree to predict if students pass a class (Y or N) using the dataset below, based on their current GPA (H, M, or L) and whether or not they studied (S or NS). Use log2, i.e., the log base is 2, when you calculate and write answers. Show the steps how you obtain the result for all questions below for full credits. GPS Studied Passed L NS N L S N L S Y M NS N M S Y M S Y H S Y H NS Y What is the entropy of the attribute "Passed"? If we were to split using the attribute "GPA", (a) what are the entropies of each of the three children nodes (i.e., GPA=L, GPA=M, and GPA=H) and (b) what is the weighted entropy of the three children nodes from the split? If we were to split using the attribute "Studied", (a) what are the entropies of each of the two children nodes (Studied=S and Studied=NS) and (b) what is the weighted entropy of the two children nodes from the split? (4) Which attribute should be split first? Explain why.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related General Management Questions!