# Question

Using software, analyze the relationship between x = college education and y = percentage single-parent families, for the data in Table 3.6, which are in the U.S. Statewide Crime data file on the text CD.

a. Construct a scatterplot. Based on your plot, identify two observations that seem quite different from the others, having a y value relatively large in one case and somewhat small in the other case.

b. Find the regression equation (i) for the entire data set, (ii) deleting only the first of the two outlying observations, and (iii) deleting only the second of the two outlying observations.

c. Is either deleted observation influential on the slope? Summarize the influence.

d. Including D.C., ŷ = 21.2 + 0.089x, whereas deleting that observation, ŷ = 28.1 - 0.206x. Find ŷ for D.C. in the two cases. Does the predicted value for D.C. depend much on which regression equation is used?

a. Construct a scatterplot. Based on your plot, identify two observations that seem quite different from the others, having a y value relatively large in one case and somewhat small in the other case.

b. Find the regression equation (i) for the entire data set, (ii) deleting only the first of the two outlying observations, and (iii) deleting only the second of the two outlying observations.

c. Is either deleted observation influential on the slope? Summarize the influence.

d. Including D.C., ŷ = 21.2 + 0.089x, whereas deleting that observation, ŷ = 28.1 - 0.206x. Find ŷ for D.C. in the two cases. Does the predicted value for D.C. depend much on which regression equation is used?

## Answer to relevant Questions

Let x = sodium and y = sugar for the breakfast cereal data in the Cereal data file on the text CD and in Table 2.3 in Chapter 2. a. Construct a scatterplot. Do any points satisfy the two criteria for a point to be ...Explain what’s wrong with the way regression is used in each of the following examples: a. Winning times in the Boston marathon (at www.bostonmarathon.org) have followed a straight line decreasing trend from 160 minutes in ...For each case in the previous exercise, a. Indicate whether each variable is quantitative or categorical. b. Describe the type of graph that could best be used to portray the results. Is there a relationship between the amount of dust carried over large areas of the Atlantic and the Caribbean and the amount of rainfall in African regions? In an article (by J. M. Prospero and P. J. Lamb, Science , vol. ...For a study of counties in Florida, the table shows part of a printout for the regression analysis relating y = median income (thousands of dollars) to x = percent of residents with at least a high school education. a. ...Post your question

0