Question: Problem 1. (40 points)This exercise consists of a complete set of data analysis steps using linear regression modeling implemented on the Cars93 dataset. The main
Problem 1. (40 points)This exercise consists of a complete set of data analysis steps using linear regression modeling implemented on the Cars93 dataset. The main problem of interest is to find the predictors for fuel efficiency of cars. This dataset is a part of the DAAG package that needs to be installed using the command, install.packages(DAAG) in R and loaded by the command, library(DAAG).
1)(5 points) Create the variable of interest by averaging the MPG city and MPG highway.
2)(5 points) In this step we start the model-building method of forward selection/ backward elimination of covariates. Begin fitting the models one covariate at the time (forward selection).
3)(5 points) Begin building the multivariate model by keeping the most significant covariate from step 2 and adding a second covariate one at a time (forward selection).
4)(5 points) If the covariate from step 3 attenuates the significance of the initially selected covariate from step 2, the latter covariate is removed from the model (backward elimination). If not, keep both variables in the model and continue with next.
5)(5 points) Repeat steps 2, 3 and 4 until the set of variables is exhausted.
6)(5 points) By now we have all the significant main effects into the model. Begin adding all meaningful 2-way interactions (there are many possible ones).
7)(5 points) Use the R-squared, adjusted R-squared statistics and all graphing diagnostic/goodness-of-fit tools to assess the appropriateness of the selected model.
8)(5 points) Write a hypothetical methods and results sections of a paper titled Predictors of the fuel efficiency of cars.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
