Question: Given the scenario below; it is a small sample dataset for the student engagement in class routines and a prediction to whether such student will
Given the scenario below; it is a small sample dataset for the student engagement in class routines and a prediction to whether such student will pass or fail a particular course. You're required to apply the decision tree using Entropy and Gain Ratio to determine the factors that most significantly influence the outcome.
Hints or simply and if it happens you encounter set it to
tableStudy Hours,Attendance,tablePreviousGradesParticipation,ResultLowPoor,Low,NoFailMediumFair,Medium,Yes,PassHighGood,High,Yes,PassLowGood,Low,NoFailMediumPoor,Medium,NoFailHighFair,High,Yes,PassLowFair,Low,Yes,FailMediumGood,Medium,NoPassHighPoor,High,Yes,PassLowFair,Medium,Yes,Fail
Repeat question a using Gini index and gain ratio. Compare the final decision tree you obtain in both a and here. Is there any difference? If there is or not, explain the reason.
Hints: the steps for computing gain ratio are the same in a and b and the splitting information formula remains the same in both cases ie it uses Logarithm. Only the gini index computation differs.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
