Question: 3 . This question should be answered using the Ionosphere data set, which is part of the mlbench package. This radar data was collected by
This question should be answered using the Ionosphere data set, which is part of the mlbench package. This radar data was collected by a system in Goose Bay, Labrador. The data frame consists of observations on independent variables. The last column in the dataframe is a categorical variable, Class defining the free electrons in the ionosphere: good radar returns are those showing evidence of some type of structure in the ionosphere. bad returns are those that do not.
a Produce some numerical and graphical summaries of the Ionosphere data. Do there appear to be any patterns?
b Notice that the second the column contains only one single value, so remove that column and work on the rest of the questions using the new dataset. Perform a KNearest Neighbors KNN algorithm with K where Class is the response, and the rest columns in the dataset as predictors.
c Compute the confusion matrix and overall fraction of correct predictions. Explain what the confusion matrix is telling you about the types of mistakes made by KNN algorithm.
d Split the data randomly into a training set and a test set Make sure to use set.seed for reproducible results. Fit the KNN model K
e Repeat d using K
f Repeat d using K
g Which of these methods appears to provide the best results on this data? Please answer and mark abcdefg for clarity
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
