# Question: Refer to the Baseball 2012 data which report information on

Refer to the Baseball 2012 data, which report information on the 30 Major League Baseball teams for the 2012 season. Let the number of games won be the dependent variable and the following variables be independent variables: team batting average, number of stolen bases, number of errors committed, team ERA, number of home runs, and whether the team plays in the American or the National League.

a. Use a statistical software package to determine the multiple regression equation. Discuss each of the variables. For example, are you surprised that the regression coefficient for ERA is negative? Is the number of wins affected by whether the team plays in the National or the American League?

b. Find the coefficient of determination for this set of independent variables.

c. Develop a correlation matrix. Which independent variables have strong or weak correlations with the dependent variable? Do you see any problems with multicollinearity?

d. Conduct a global test on the set of independent variables. Interpret.

e. Conduct a test of hypothesis on each of the independent variables. Would you consider deleting any of the variables? If so, which ones?

f. Rerun the analysis until only significant regression coefficients remain in the analysis. Identify these variables.

g. Develop a histogram or a stem-and-leaf display of the residuals from the final regression equation developed in part (f). Is it reasonable to conclude that the normality assumption has been met?

h. Plot the residuals against the fitted values from the final regression equation developed in part (f). Plot the residuals on the vertical axis and the fitted values on the horizontal axis.

a. Use a statistical software package to determine the multiple regression equation. Discuss each of the variables. For example, are you surprised that the regression coefficient for ERA is negative? Is the number of wins affected by whether the team plays in the National or the American League?

b. Find the coefficient of determination for this set of independent variables.

c. Develop a correlation matrix. Which independent variables have strong or weak correlations with the dependent variable? Do you see any problems with multicollinearity?

d. Conduct a global test on the set of independent variables. Interpret.

e. Conduct a test of hypothesis on each of the independent variables. Would you consider deleting any of the variables? If so, which ones?

f. Rerun the analysis until only significant regression coefficients remain in the analysis. Identify these variables.

g. Develop a histogram or a stem-and-leaf display of the residuals from the final regression equation developed in part (f). Is it reasonable to conclude that the normality assumption has been met?

h. Plot the residuals against the fitted values from the final regression equation developed in part (f). Plot the residuals on the vertical axis and the fitted values on the horizontal axis.

**View Solution:**## Answer to relevant Questions

Refer to the Buena School District bus data. First, add a variable to change the type of bus (diesel or gasoline) to a qualitative variable. If the bus type is diesel, then set the qualitative variable to 0. If the bus type ...Terry and Associates is a specialized medical testing center in Denver, Colorado. One of the firmâ€™s major sources of revenue is a kit used to test for elevated amounts of lead in the blood. Workers in auto body shops, ...A six-sided die is rolled 30 times and the numbers 1 through 6 appear as shown in the following frequency distribution. At the .10 significance level, can we conclude that the die isfair?The director of advertising for the Carolina Sun Times, the largest newspaper in the Carolinas, is studying the relationship between the type of community in which a subscriber resides and the section of the newspaper he or ...A study regarding the relationship between age and the amount of pressure sales personnel feel in relation to their jobs revealed the following sample information. At the .01 significance level, is there a relationship ...Post your question