# Question

Pedro Martinez, who retired from Major League Baseball in 2012, had a stellar career, helping the Boston Red Sox to their first World Series title in 86 years in 2004. The next year he became a free agent and the New York Mets picked him up for $53 million for 4 years. Even after the move to New York, Martinez had his own fans. Possibly, he attracted more fans to the ballpark when he pitched at home, helping to justify his multimillion dollar contract. Was there really a “Pedro effect” in attendance? We have data for the Mets home games of the 2005 season. The regression has the following predictors:

Weekend 1 if game is on a weekend day or night, 0 otherwise

Yankees 1 if game is against the Yankees (a hometown rivalry), 0 otherwise

Rain Delay 1 if the game was delayed by rain (which might have depressed attendance), 0 otherwise

Opening Day 1 for opening day, 0 for the others

Pedro Start 1 if Pedro was the starting pitcher, 0 otherwise Here’s the regression.

a) All of these predictors are of a special kind. What are they called?

b) What is the interpretation of the coefficient for Pedro Start?

c) If we’re primarily interested in Pedro’s effect on attendance, why is it important to have the other variables in the model?

d) Could Pedro’s agent claim, based on this regression, that his man attracts more fans to the ballpark? What statistics should he cite?

Weekend 1 if game is on a weekend day or night, 0 otherwise

Yankees 1 if game is against the Yankees (a hometown rivalry), 0 otherwise

Rain Delay 1 if the game was delayed by rain (which might have depressed attendance), 0 otherwise

Opening Day 1 for opening day, 0 for the others

Pedro Start 1 if Pedro was the starting pitcher, 0 otherwise Here’s the regression.

a) All of these predictors are of a special kind. What are they called?

b) What is the interpretation of the coefficient for Pedro Start?

c) If we’re primarily interested in Pedro’s effect on attendance, why is it important to have the other variables in the model?

d) Could Pedro’s agent claim, based on this regression, that his man attracts more fans to the ballpark? What statistics should he cite?

## Answer to relevant Questions

In Exercise 19, we raised questions about two gourmet pizzas. After removing them, the resulting regression looks like this. A plot of the residuals against the predicted values for this regression looks like this. It has ...Cost of Living 2013, revisited. For the first model considered in Exercise 27, with all four predictors in the model, a plot of Leverage values shows the two largest values are Hong Kong (0.18) and Luanda, Angola (0.15). a) ...In Exercise we saw that there were several potential high influence points. After a researcher set aside those four countries, she refit the model in Exercise 33. A plot of residuals vs. predicted values showed: a) What ...Are the following data time series? If not, explain why. a) Quarterly earnings of Microsoft Corp. b) Unemployment in August 2010 by Education level. c) Time spent in training by workers in NewCo. d) Numbers of e- mails ...For each of the following time series, suggest an appropriate model: a) Daily stock prices that reveal erratic periods of up and down swings. b) Monthly sales that reveal a consistent percentage increase from month to ...Post your question

0