Question: QUESTION 1 ( Classification ) [ TOTAL MARKS: 2 5 ] This question uses the dataset in Appendix A which shows a set of results

QUESTION 1(Classification)[TOTAL MARKS: 25] This question uses the dataset in Appendix A which shows a set of results for Covid-19 tests. The attributes captured are Gender, Age, Setting (where they believed the exposure took place), TripsMade (in the last 5 days), and Home (their living arrangements). Using a Decision Tree approach, answer the following questions. Q 1(a)[3 Marks] Write the TDIDT algorithm (pseudocode) and make sure to explain what is happening at each step. Explain why the algorithm is guaranteed to terminate. Q 1(b)[7 Marks] Using a takelast attribute selection strategy, create a decision tree for this dataset. Q 1(c)[5 Marks] Using an Information Gain approach, calculate the expected information needed to classify a tuple in the full dataset D. Q 1(d)[10 Marks] Again using Information Gain, for the first attribute selection, rank each attribute in the order they would be selected. As part of your answer, which would be the first attribute selected? Why?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!