Question: The table below lists a sample of data from a census. 20 pts There are four descriptive features and one target feature in this dataset:
- The table below lists a sample of data from a census. 20 pts
There are four descriptive features and one target feature in this dataset:
AGE, a continuous feature listing the age of the individual
EDUCATION, a categorical feature listing the highest education award achieved by the individual (high school, bachelors, doctorate)
MARITAL STATUS (never married, married, divorced)
OCCUPATION (transport = works in the transportation industry; professional = doctors, lawyers, etc.; agriculture = works in the agricultural industry; armed forces = is a member of the armed forces)
ANNUAL INCOME, the target feature with 3 levels (< 25K, 25K- 50K, > 50K)
a. Calculate the entropy for this dataset.
b. Calculate information gain (based on entropy) for the EDUCATION, MARITAL STATUS, and OCCUPATION features.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
