# Question: In his best selling book Moneyball author Michael Lewis discusses how

In his best-selling book Moneyball, author Michael Lewis discusses how statistics can be used to judge both a baseball player’s potential and a team’s ability to win games. One aspect of this analysis is that a team’s on-base percentage is the best predictor of winning percentage. The on-base percentage is the proportion of time a player reaches a base. For example, an on-base percentage of 0.3 would mean the player safely reaches bases 3 times out of 10, on average. For the 2010 baseball season, winning percentage, y, and on-base percentage, x,

are linearly related by the least-squares regression equation y-hat = 3.4722x - 0.6294.

(a) Interpret the slope.

(b) For 2010, the lowest on-base percentage was 0.298 and the highest on-base percentage was 0.350. Use this information to explain why it does not make sense to interpret the y-intercept.

(c) Would it be a good idea to use this model to predict the winning percentage of a team whose on-base percentage was 0.250? Why or why not?

(d) The 2010 World Series Champion San Francisco Giants had an on-base percentage of 0.321 and a winning percentage of 0.568. What is the residual for San Francisco? How would you interpret this residual?

are linearly related by the least-squares regression equation y-hat = 3.4722x - 0.6294.

(a) Interpret the slope.

(b) For 2010, the lowest on-base percentage was 0.298 and the highest on-base percentage was 0.350. Use this information to explain why it does not make sense to interpret the y-intercept.

(c) Would it be a good idea to use this model to predict the winning percentage of a team whose on-base percentage was 0.250? Why or why not?

(d) The 2010 World Series Champion San Francisco Giants had an on-base percentage of 0.321 and a winning percentage of 0.568. What is the residual for San Francisco? How would you interpret this residual?

## Answer to relevant Questions

The least-squares regression equation y-hat = 0.7676x - 52.6841 relates the carbon dioxide emissions (in hundred thousands of tons), y, and energy produced (hundred thousands of megawatts), x, for all countries in the world ...An engineer wants to determine how the weight of a car, x, affects gas mileage, y. The following data represent the weights of various domestic cars and their miles per gallon in the city for the 2011 model year. (a) Find ...One of the biggest factors in determining the value of a home is the square footage. The following data represent the square footage and asking price (in thousands of dollars) for a random sample of homes for sale in Naples, ...Use the results from Problem 28 in Section 4.1 and Problem 20 in Section 4.2 to: (a) Compute the coefﬁcient of determination, R2. (b) Interpret the coefﬁcient of determination and comment on the adequacy of the linear ...(a) Construct a frequency marginal distribution. (b) Construct a relative frequency marginal distribution. (c) Construct a conditional distribution by x. (d) Draw a bar graph of the conditional distribution found in part ...Post your question