Question: The following table contains a training dataset with six observations, three independent variables, and one qualitative dependent variable.(rot means red, schwarz means black) Suppose the
The following table contains a training dataset with six observations, three independent variables, and one qualitative dependent variable.(rot means red, schwarz means black)

Suppose the dataset is to be used to make a prediction for using the -nearest-neighbor method when 1 = 2 = 3 = 1. (a) (3 points) Calculate the squared Euclidean distance between each observation and the point 1 = 2 = 3 = 1. (b) (1 point) What is the prediction with = 1? Why? (c) (1 point) What is the prediction with = 4? Why? (d) (1 point) Why is a value of = 4 not recommended for this data set?
\begin{tabular}{ccccc} Nr. & X1 & X2 & X3 & Y \\ \hline 1 & 6 & 4 & 4 & Rot \\ 2 & 8 & 7 & 2 & Schwarz \\ 3 & 2 & 2 & 8 & Schwarz \\ 4 & 1 & 3 & 5 & Schwarz \\ 5 & 0 & 4 & 3 & Rot \\ 6 & 0 & 2 & 2 & Rot \end{tabular}
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
