Question: The table below lists a sample of data from a census. 20 pts There are four descriptive features and one target feature in this dataset:

  1. The table below lists a sample of data from a census. 20 pts

There are four descriptive features and one target feature in this dataset:

AGE, a continuous feature listing the age of the individual

EDUCATION, a categorical feature listing the highest education award achieved by the individual (high school, bachelors, doctorate)

MARITAL STATUS (never married, married, divorced)

OCCUPATION (transport = works in the transportation industry; professional = doctors, lawyers, etc.; agriculture = works in the agricultural industry; armed forces = is a member of the armed forces)

ANNUAL INCOME, the target feature with 3 levels (< 25K, 25K- 50K, > 50K)

a. Calculate the entropy for this dataset.

b. Calculate information gain (based on entropy) for the EDUCATION, MARITAL STATUS, and OCCUPATION features.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!