# Question

Breakfast cereal manufacturers publish nutrition information on each box of their product. As we saw in Chapter 16, there is a long history of cereals being associated with nutrition. Here’s a regression to predict the number of Calories in breakfast cereals from their Sodium, Potassium, and Sugar content, and some diagnostic plots.

The shaded part of the histogram corresponds to the two cereals plotted with x’s in the normal probability plot of the leverages and the residuals plot. These are All-Bran with Extra Fiber and All-Bran.

a) What do the displays say about the influence of these two cereals on this regression? (The histogram is of the Studentized residuals.)

Here’s another regression with dummy variables defined for each of the two bran cereals.

b) Explain what the coefficients of the bran cereal dummy variables mean.

c) Which regression would you select for understanding the interplay of these nutrition components. Explain.

d) As you can see from the scatterplot, there’s another cereal with high potassium. Not too surprisingly, it is 100% Bran. But it does not have leverage as high as the other two bran cereals. Do you think it should be treated like them (i.e., removed from the model, fit with its own dummy, or left in the model with no special attention, depending on your answer to part c)? Explain.

The shaded part of the histogram corresponds to the two cereals plotted with x’s in the normal probability plot of the leverages and the residuals plot. These are All-Bran with Extra Fiber and All-Bran.

a) What do the displays say about the influence of these two cereals on this regression? (The histogram is of the Studentized residuals.)

Here’s another regression with dummy variables defined for each of the two bran cereals.

b) Explain what the coefficients of the bran cereal dummy variables mean.

c) Which regression would you select for understanding the interplay of these nutrition components. Explain.

d) As you can see from the scatterplot, there’s another cereal with high potassium. Not too surprisingly, it is 100% Bran. But it does not have leverage as high as the other two bran cereals. Do you think it should be treated like them (i.e., removed from the model, fit with its own dummy, or left in the model with no special attention, depending on your answer to part c)? Explain.

## Answer to relevant Questions

The Brief Case in Chapter 4 introduced the Cost of Living dataset that contains an estimate of the cost of living for 322 cities worldwide in 2013. In addition to the overall Cost of Living Index are: the Rent Index, ...In Exercise we found a model for the gross revenue from U.S. movie theatres for 106 recent movies that were rated either R or PG-13. A plot of residuals against predicted revenue shows: A histogram of the y variable, US ...For each of the following, show how you would code dummy (indicator) variables to include in a regression model. a) Type of residence (Apartment, Condominium, Townhouse, Single family home) b) Employment status (Full-time, ...An Additive regression model for the Apple prices is: a) What is the name for the kind of variable called Jan in this model? b) Why is there no predictor variable for December? The price of bananas fluctuates on the world market. Here are the prices ($/tonne) for the years 2000–2004. a) Find a 3- year moving average prediction for the price in 2005. b) Find a prediction for 2005 with an ...Post your question

0