# Question: Collinearity is sometimes described as a problem with the data

Collinearity is sometimes described as a problem with the data, not the model. Rather than filling the scatterplot of X1 on X2, the data concentrate along a diagonal. For example, the following plot shows monthly percentage changes in the whole stock market and the S&P 500 (in excess of the risk-free rate of return). The data span the same period considered in the text, running monthly from 1995 through 2011.

(a) Data for two months (February and March of 2000, identified as × in the plot) deviate from the pattern evident in other months. What makes these months unusual?

(b) If you were to use both returns on the market and those on the S&P 500 as explanatory variables in the same regression, would these two months be leveraged?

(c) Would you want to use these months in the regression or exclude them from the multiple regression?

(a) Data for two months (February and March of 2000, identified as × in the plot) deviate from the pattern evident in other months. What makes these months unusual?

(b) If you were to use both returns on the market and those on the S&P 500 as explanatory variables in the same regression, would these two months be leveraged?

(c) Would you want to use these months in the regression or exclude them from the multiple regression?

## Answer to relevant Questions

Regression models that describe macroeconomic properties in the United States often have to deal with large amounts of collinearity. For example, suppose we want to use as explanatory variables the disposable income and the ...Modern steel mills are very automated and need to monitor their substantial energy costs carefully to be competitive. In making cold-rolled steel (as used in bodies of cars), it is known that temperature during rolling and ...This data table gives annual costs of 223 commercial leases. All of these leases provide office space in a Midwestern city in the United States. The cost of the lease is measured in dollars per square foot, per year. The ...1. Intercept for high school graduate 2. Intercept for college graduate 3. Has units $thousand/year, high school graduate 4. Has units $thousand/year, college graduate 5. Difference in slopes 6. Difference in intercepts 7. ...A two-sample t-test has a lot in common with simple regression. This output summarizes the results of fitting a simple regression with only a dummy variable as the explanatory variable. The data are the same salary data used ...Post your question