The data in Table 2.9 is due to Anscombe (1973). The purpose of this exercise is to

Question:

The data in Table 2.9 is due to Anscombe (1973). The purpose of this exercise is to demonstrate how plotting data can reveal important information that is not evident in numerical summary statistics.

a. Compute the averages and standard deviations of each column of data. Check that the averages and standard deviations of each of the \(x\) columns are the same, within two decimal places, and similarly for each of the \(y\) columns.

b. Run four regressions, (i) \(y_{1}\) on \(x_{1}\), (ii) \(y_{2}\) on \(x_{1}\), (iii) \(y_{3}\) on \(x_{1}\), and (iv) \(y_{4}\) on \(x_{2}\). Verify, for each of the four regressions fits, that \(b_{0} \approx 3.0\), \(b_{1} \approx 0.5, s \approx 1.237\), and \(R^{2} \approx 0.677\), within two decimal places.

c. Produce scatter plots for each of the four regression models that you fit in part (b).

d. Discuss the fact that the fitted regression models produced in part (b) imply that the four datasets are similar, though the four scatter plots produced in part (c) yield a dramatically different story.

image text in transcribed


Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Question Posted: