Question: The image below shows a scatterplot for a dataset with 6 people (A,B,C,D,E, and F). Each person has a score for two predictor variables and


The image below shows a scatterplot for a dataset with 6 people (A,B,C,D,E, and F). Each person has a score for two predictor variables and an outcome variable (Win vs. Lose).

Imagine that you performed a 6-fold cross-validation on the data. Fill in the 6 tables that would be created from the 6-fold cross-validation procedure. The first table (Fold 1) is already completed for you. Work in reverse alphabetical order, where the First fold’s test set has person F, the 2nd Fold’s test set has person E, the 3rd Fold’s test set has person D.


Fold 1:

Train Dataset

Participant

Outcome

Predictor 1

Predictor 2

A

Win

0

0

B

Win

0

2

C

Win

2

0

D

Win

2

2

E

Lose

1

1

Test Dataset

Participant

Outcome

Predictor 1

Predictor 2

F

Win

1

2


Fold 2:

Train Dataset

Participant

Outcome

Predictor 1

Predictor 2

A

win

0

0

B

win

0

2

C

win

2

0

D

win

2

2

F

win

1

2

Test Dataset

Participant

Outcome

Predictor 1

Predictor 2

E

lose

1

1


Fold 3:

Train Dataset

Participant

Outcome

Predictor 1

Predictor 2

A

win

0

0

B

win

0

2

C

win

2

0

E

lose

1

1

F

win

1

2


Test Dataset

Participant

Outcome

Predictor 1

Predictor 2

D

win

2

2


Fold 4:

Train Dataset

Participant

Outcome

Predictor 1

Predictor 2






















Test Dataset

Participant

Outcome

Predictor 1

Predictor 2






Fold 5:

Train Dataset

Participant

Outcome

Predictor 1

Predictor 2





















Test Dataset

Participant

Outcome

Predictor 1

Predictor 2






Fold 6:

Train Dataset

Participant

Outcome

Predictor 1

Predictor 2






















Test Dataset

Participant

Outcome

Predictor 1

Predictor 2





--------------------------------------------------------------------------------------------------------

If you used a 3-Nearest Neighbor Model to predict the test outcome from the training dataset, what is your error (accuracy) on the test set for….

*Hint: You can look at the image below to figure out the 3-nearest neighbors for the case you are predicting in each test set

Test Accuracy for Fold 1_________________

Test Accuracy for Fold 2________________

Test Accuracy for Fold 3________________

Test Accuracy for Fold 4________________

Test Accuracy for Fold 5________________

Test Accuracy for Fold 6________________

Based on the errors above, what is the average cross-validated accuracy when using a 3-Nearest neighbor Model__________

Create Tables for a 2-Fold Cross validation using the same image as before. You can choose which participants are assigned to go into the first fold’s training and test set randomly.

Step by Step Solution

3.45 Rating (148 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

Based on the provided information here are the completed tables for the 6fold crossvalidation and the calculation of average crossvalidated accuracy u... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!