Question: Exercise 3 (Breast Cancer Detection) For this exercise we will use data found in wisc-train. csv and wisc-test. csv, which contain train and test data,

 Exercise 3 (Breast Cancer Detection) For this exercise we will use

Exercise 3 (Breast Cancer Detection) For this exercise we will use data found in wisc-train. csv and wisc-test. csv, which contain train and test data, respectively. wisc. csv is provided but not used. This is a modification of the Breast Cancer Wisconsin (Diagnostic) dataset from the UCI Machine Learning Repository. Only the first 10 feature variables have been provided. (And these are all you should use.) . UCI Page . Data Detail You should consider coercing the response to be a factor variable if it is not stored as one after importing the data. (a) The response variable class has two levels: M if a tumor is malignant, and B if a tumor is benign. Fit three models to the training data. . An additive model that uses radius, smoothness, and texture as predictors . An additive model that uses all available predictors . A model chosen via backwards selection using AIC. Use a model that considers all available predictors as well as their two-way interactions for the start of the search. For each, obtain a 5-fold cross-validated misclassification rate using the model as a classifier that seeks to minimize the misclassification rate. Based on this, which model is best? Relative to the best, are the other two underfitting or over fitting? Report the test misclassification rate for the model you picked as the best

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!