Question: Here is a dataset: Position Level Performance Previous Promotion Promotion Entry High No Yes Mid - level High Yes Yes Entry High No Yes Entry
Here is a dataset:
Position Level Performance Previous Promotion Promotion
Entry High No Yes
Midlevel High Yes Yes
Entry High No Yes
Entry Low No Yes
Entry Low No No
Midlevel Low Yes No
Midlevel High No Yes
Midlevel Low Yes No
Entry Low Yes Yes
I calc the overall Gini index for this:
Promotion
Proportion No
Proportion Yes
Gini Index
Gini Index
Gini Index
Gini Index
To make the next split, I calc all Gini indices for subregions RR:
Weighted average for Position Level
Weighted average for Performance
Weighted average for Previous Promotion
So then I know the split should be on Performance. The next split, I calc all Gini indices:
Weighted Previous Promotion
Weighted Position
So the next split is Previous Promotion. Which leaves one more split, and Gini calc is:
Weighted Position Level
The Gini index overall after regions splits. Meaning I made an improvement. Is that correct?
Also, how can I draw the decision tree in a treebased format? That is draw the decision tree and each split, and indicate which prediction we would make for each region? Please help me Thanks.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
