Question: Name: 3. (5 = 2+3 points ) Logistic regression. Suppose you are given a dataset consisting of the gene expression data for 500 genes and

Name: 3. (5 = 2+3 points ) Logistic regression. Suppose you are given a dataset consisting of the gene expression data for 500 genes and the binary values for 50 symptoms for each of 300 patients and tasked with selecting a subset of genes for a linear regression model using either forward selection or backward elimination. (a) Which method should you use, and why? (b) Is that method guaranteed to give the best possible model? Why or why not
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
