Question: the training data set below. Your goal is to build a classifier to predict whether a patient should or should not be prescribed contact lenses.

the training data set below. Your goal is to build a classifier to predict whether a patient
should or should not be prescribed contact lenses. The classifier will output 1 when it
predicts that contact lenses should be prescribed and 0 when it predicts that they should
not be prescribed. The output class is in the last column "Contact Lenses" and the input
attributes are: "Patient Age", "Astigmatic", "Tear Production", and "Spectacle." More
specifically, you will compare Nave Bayes (NB) and Decision Tree (DT).
For Nave Bayes (NB), you will use m-estimate from the lecture with m=2 and p=0.5
for probability estimations.
For Decision Tree (DT), you will follow the lecture's code to build your trees with-
out pruning except that multiple-way splitting is allowed and use Information Gain (IG)
to select the best attribute. In the case of ties, break ties in favor of the leftmost feature.
(a)(18 points) Compare the performance of NB vs. DT using 2-fold cross-validation
(CV) and report their 2-fold CV accuracy. For the i th fold, the testing
dataset is composed of all the data points whose (ID mod2=i-1).
For each fold, show the induced Nave Bayes (in order of left to right
columns) and DT models.
(b)(6 points) Based on the 2-fold CV accuracy from (a), which classifier, NB or
DT, would you choose? Report your final model for the selected classifier.
the training data set below. Your goal is to

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!