Question: 2. Develop a decision tree to classify the Ionosphere data set by using MATLAB. This dataset is in MATLAB (>> load ionosphere). Write a MATLAB

2. Develop a decision tree to classify the Ionosphere data set by using MATLAB. This dataset is in MATLAB (>> load ionosphere). Write a MATLAB code to solve the following problems, and save it as a .m file. Do NOT use the Classification Learner App. Submit the .m (.mat is not required) file along with answers for the following questions typed as comments (%) in the .m file. 1) Generate DT1 with default options. a. Calculate resubstitution error. b. Estimate generalization error by 10-fold cross validation. 2) Find an optimal value of minLeaf to avoid overfitting as follows: a. Set minLeaf values between 10 and 100 that are spaced exponentially apart. For each minLeaf value, estimate generalization error by 10-fold cross validation. b. Generate a plot of CV error vs. minLeaf. c. Find the optimal value of minLeaf (roughly) from the plot. 3) Set minLeaf to the value you found from question 2) and generate DT2. a. Calculate resubstitution error. b. Estimate generalization error by 10-fold cross validation. c. Compare DT1 and DT2. 2. Develop a decision tree to classify the Ionosphere data set by using MATLAB. This dataset is in MATLAB (>> load ionosphere). Write a MATLAB code to solve the following problems, and save it as a .m file. Do NOT use the Classification Learner App. Submit the .m (.mat is not required) file along with answers for the following questions typed as comments (%) in the .m file. 1) Generate DT1 with default options. a. Calculate resubstitution error. b. Estimate generalization error by 10-fold cross validation. 2) Find an optimal value of minLeaf to avoid overfitting as follows: a. Set minLeaf values between 10 and 100 that are spaced exponentially apart. For each minLeaf value, estimate generalization error by 10-fold cross validation. b. Generate a plot of CV error vs. minLeaf. c. Find the optimal value of minLeaf (roughly) from the plot. 3) Set minLeaf to the value you found from question 2) and generate DT2. a. Calculate resubstitution error. b. Estimate generalization error by 10-fold cross validation. c. Compare DT1 and DT2
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
