Question: Question 1 : table [ [ Temperature , Humidity,PlayTennis ] , [ Hot , High,No ] , [ Hot , Normal,Yes ] , [
Question :
tableTemperatureHumidity,PlayTennisHotHigh,NoHotNormal,YesMildHigh,NoMildNormal,YesColdHigh,YesColdNormal,Yes
For this problem, you can write your answers using but it may be helpful to note that ~~
i What is the initial entropy of the training sample?
ii What is the information gain for PlayTennis Temperature
iii What is the entropy and gain for PlayTennis Humidity
iv Draw the full decision tree that would be learned for this dataset. You do not need to show any calculations.
Question :
A dataset has been collected to develop a classifier. It requires the following information to understand its features. Do this exercise for the dataset that you have selected for your own project.
Number of Classes
Number of Features Variableshow many of them are categorical and how many of them are continuous
Number of Samples
Number of Samples for each class
How imbalance the dataset is
Average value of each feature variable and its standard deviation for each class
Normalise the data according the maximum and the difference between maximum and minimum values for each feature, and discuss how it may affect the classifier design.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
