# Question: The file P03 55 xlsx contains baseball data on all MLB teams

The file P03_55.xlsx contains baseball data on all MLB teams from during the years 2004–2009. For each year and team, the total salary and the number of (regular-season) wins are listed.

a. Rearrange the data so that there are six columns: Team, Year, Salary Last Year, Salary This Year, Wins Last Year, and Wins This Year. You don’t need rows for 2004 rows, because the data for 2003 isn’t available for Salary Last Year and Wins Last Year. Your ending data set should have 5*30 rows of data.

b. Run a multiple regression for Wins This Year versus the other variables (besides Team). Then run a forward stepwise regression with these same variables. Compare the two equations, and explain exactly what the coefficients of the equation from the forward method imply about wins.

c. The Year variable should be insignificant. Is it?

Why would it be contradictory for the “true” coefficient of Year to be anything other than zero?

d. Statistical inference from regression equations is all about inferring from the given data to a larger population. Does it make sense to talk about a larger population in this situation? If so, what is the larger population?

a. Rearrange the data so that there are six columns: Team, Year, Salary Last Year, Salary This Year, Wins Last Year, and Wins This Year. You don’t need rows for 2004 rows, because the data for 2003 isn’t available for Salary Last Year and Wins Last Year. Your ending data set should have 5*30 rows of data.

b. Run a multiple regression for Wins This Year versus the other variables (besides Team). Then run a forward stepwise regression with these same variables. Compare the two equations, and explain exactly what the coefficients of the equation from the forward method imply about wins.

c. The Year variable should be insignificant. Is it?

Why would it be contradictory for the “true” coefficient of Year to be anything other than zero?

d. Statistical inference from regression equations is all about inferring from the given data to a larger population. Does it make sense to talk about a larger population in this situation? If so, what is the larger population?

## Relevant Questions

Do the previous problem, but use the basketball data on all NBA teams in the file P03_56.xlsx. a. Rearrange the data so that there are six columns: Team, Year, Salary Last Year, Salary This Year, Wins Last Year, and Wins ...Dupree Fuels Company is facing a difficult problem. Dupree sells heating oil to residential customers. Given the amount of competition in the industry, both from other home heating oil suppliers and from electric and natural ...The file P12_06.xlsx contains the weekly sales at the local outlet of West Coast Video Rentals for each of the past 36 weeks. Perform a runs test and find a few autocorrelations to determine whether this time series is ...The file P03_30.xlsx gives monthly exchange rates (units of local currency per U.S. dollar) for nine currencies. Technical analysts believe that by charting past changes in exchange rates, it is possible to predict future ...Consider a random walk model with the following equation: Yt = Yt -1 + 500 9 et, where et is a normally distributed random series with mean 0 and standard deviation 10.a. Use Excel to simulate a time series that behaves ...Post your question