Question: 2. For question 2, import the data from https ://raw . githubusercontent . com/hgweon2/data/main/hw3-data. txt The imported dataset contains 200 observations with 3 variables: y,


2. For question 2, import the data from https ://raw . githubusercontent . com/hgweon2/data/main/hw3-data. txt The imported dataset contains 200 observations with 3 variables: y, x1 and x2. (a) Plot a scatterplot matrix and briefly discuss the relationships between the variables. (b) Obtain the fitted model: Y = Bo + 3141 + 3242. Check the model assumptions using appropriate graphical and testings approaches. (c) Was there any influential point? Use Cook's distance with threshold = 4. Report the indices of the influential points. (d) Among the influential points, how many of them are also considered outliers (whose absolute standardized residuals are greater than 2)? (e) Suppose that the influential points identified in (c) were simple measurement errors. Remove the influential points from the data and repeat (b) using the updated data set. Was the removal of the influential points helpful for correcting the model assumptions
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
