Question: A - 4 . [ 1 0 marks: 2 . 5 each ] : Using the same dataset split in A - 3 . a
A
marks: each: Using the same dataset split in Aa
Page of
ISE: Homework
a Build a Random forest classifier for predicting the class label with trees. Fit the classifier
using the training set. Set criterion to entropy and randomstate to
b Draw the trees using scikit learn sklearn
c Test the classifier on the testing data set, and print the confusion matrix and classification
metrics Accuracy sensitivity Recall Precision of the Random forest classifier.
d Repeat Aac using a Random forest with trees instead of
A marks: Calculate the Information Gain IG for the class variable Drug given the feature
selected BP as a root node.
A marks: From the decision tree built in A write three classification rules using the
normalized values first then return it to the original values.
A marks: Write an association rule for BP Cholestrol", which rule has the highest
accuracy? Write the corresponding support and accuracy.
A marks: Repeat parts b c and d in A using the Nave Bayes GaussianNB classifier.
A Compare the performance of the Nave Bayes against the built decision tree and random forest
classifiers using confusion matrix. Based on the comparison, which one is the best to use with
the given datat set?
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
