Question: Consider a simulated dataset with a binary response ` y ` and $ 5 0 0 0 $ predictors measured on $n = 5 0
Consider a simulated dataset with a binary response y and $$ predictors measured on $n $ cases, saved as a matrix x
r
set.seed
y crepc repc
x matrixNA nrow ncol
for i in :
x i rnorm
A simple classifier is applied to the simulated dataset, where the classification is performed in two steps:
Step Feature selection: Select $$ predictors with smallest $p$values from twosample $t$tests
Step Model fitting: Fit a linear discriminant analysis LDA model, using only these $$ selected predictors.
We would like to compute the $$fold crossvalidation CV estimate of test accuracy rate for the classifier.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
