Question: Here we are going to apply regression on two real world datasets with multiple X variables. The first one comes out of this popular data

 Here we are going to apply regression on two real world

Here we are going to apply regression on two real world datasets with multiple X variables. The first one comes out of this popular data repository (https://archive.ics.uci.edu/ml/datasets/Wine+Quality), and the second is an actual medical research dataset. 1. Linear Regression Apply the following steps on this wind quality dataset (here ): step 1: In excel regression, use "quality" as Y variable, and use all other columns as X variables by selecting the whole block as the input x range, click on label, output the results starting from O1, and generating NO plots. step 2: Copy the winequality-red sheet to create a duplicate worksheet, step 3: In the duplicated sheet, from the excel output, identify variables with p-value greater than 0.05, which means the corresponding variables are not predictive. (Btw, 8.37E-06 means 0.000000837). Remove those variables by deleting those colums. Also delete the excel output portion. Repeat Step 1- 3, until all variables remaining are significantly predictive, i.e., with p-value less than 0.05. This might or might not require more two iterations. a. In your final model, what factors have positive influence on quality and what factors have negative influence on quality? b. use your final model to predict the quality of this particular wine

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!