Question: Part 2: Find a data set that is useful for multiple linear regression. You may use one of the data sets available under Project Information

 Part 2: Find a data set that is useful for multiple

Part 2: Find a data set that is useful for multiple linear regression. You may use one of the data sets available under "Project Information" in eLearning, find your own data set online, or use one from your studies in Biology, Environmental Science, Economics, etc. Your data set needs to include at least 6 variables (including dependent and independent). Develop a regression model using the following steps: * Check for outliers and influence points (eliminate points if needed) * Check for multi-collinearity and note any problem interactions. Be sure these are not included in your final models. Pick several candidate models that are simple, have high adjusted R2, have low Cp, and have low MSE. (Watch for multi-collinearity). Check PRESS for your chosen models to pick a final model. . * Write up your results. Create a table for your candidate models including: model, adjusted R2 MSE, Cp, PRESS, Highest VIF in the model. Choose your best fit model. Write a brief analysis on why you chose this model. Include your SAS code as an appendix. Do not include many, many pages of SAS output. Helpful SAS code: data regression; input x1-xn y@@; cards; [data] prog reg; model y x1-xn/influence; run; proc reg; model y-x1-xn/vif; run; proc rsquare adjrsq mse cp; model y x1-xn; run proc reg; model y x1-xn/cli clm p; run; Part 2: Find a data set that is useful for multiple linear regression. You may use one of the data sets available under "Project Information" in eLearning, find your own data set online, or use one from your studies in Biology, Environmental Science, Economics, etc. Your data set needs to include at least 6 variables (including dependent and independent). Develop a regression model using the following steps: * Check for outliers and influence points (eliminate points if needed) * Check for multi-collinearity and note any problem interactions. Be sure these are not included in your final models. Pick several candidate models that are simple, have high adjusted R2, have low Cp, and have low MSE. (Watch for multi-collinearity). Check PRESS for your chosen models to pick a final model. . * Write up your results. Create a table for your candidate models including: model, adjusted R2 MSE, Cp, PRESS, Highest VIF in the model. Choose your best fit model. Write a brief analysis on why you chose this model. Include your SAS code as an appendix. Do not include many, many pages of SAS output. Helpful SAS code: data regression; input x1-xn y@@; cards; [data] prog reg; model y x1-xn/influence; run; proc reg; model y-x1-xn/vif; run; proc rsquare adjrsq mse cp; model y x1-xn; run proc reg; model y x1-xn/cli clm p; run

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!