Question: (a) Construct a correlation matrix with price, acres, bedrooms, bathrooms, square feet, age, and rooms. Is there any reason to be concerned with multicollinearity based

(a) Construct a correlation matrix with price, acres, bedrooms, bathrooms, square feet, age, and rooms. Is there any reason to be concerned with multicollinearity based on the correlation matrix?

(b) Find the least-squares regression equation

yÌ… = b0 + b1x1 +b2x2 + b3x3 + b4x4 + b5x5 + b6x6, where

x1 is acres, x2 is bedrooms, and so on.

(c) Test H0: bi = 0 versus H1: at least one of the βi ≠ 0 at the a = 0.05 level of significance.

(d) Test the hypotheses H0: βi = 0 versus H1: βi ≠ 0 for i = 1, 2, . . . , 6, at the a = 0.05 level of significance.

(e) Examine your regression results and remove any explanatory variable whose coefficient is not significantly different from 0 to obtain the model of best fit.

(f) Once you have obtained your model of best fit, draw residual plots and a boxplot of the residuals to assess the adequacy of the model.

(g) Determine and interpret R2 and adjusted R2. How well does your model appear to fit the data?

(h) Use your model to predict the selling price for another house from the agent's territory that has the following characteristics: 0.18 acre, 3 bedrooms, 1 bath, 1176 square feet, 47 years old, and 6 total rooms. Compare your prediction to the actual selling price: $99,900. Location, location, location! The location of the house can have a large effect on its selling price. The first 12 houses listed are from the same zip code, the next 10 are from a second zip code, and the last 6 are from a third zip code.

(i) Construct side-by-side boxplots of selling price for the three zip codes. Is there any reason to believe that selling prices vary from one zip code to the next within the agent's territory?

(j) Introduce dummy explanatory variables to represent zip code and repeat parts (b)-(g) to find the model of best fit for price.

(k) Repeat part (h) assuming the house comes from the first zip code. Which model did a better job predicting the selling price?

(l) Explain the limitations of this model. Which, if any, can be dealt with, and how would you do so?

Price ($1000s) Acres Bedrooms Bathrooms Sq. ft. Age (yr) Rooms 104.9 0.19 3 1.0 900 44 8 109.0 0.15 3 2.0 1431 34 6. 94.9 0.20 3 1.5 1064 49 6 96.5 0.18 1.0 780 52 4 127.9 0.17 3 2.5 1140 47 6. 129.9 0.18 3 1.5 1140 41

During the early 2000s, the United States experienced a boom in the housing industry, in large part due to efforts by the government to boost consumer spending. For many, the lure of low interest rates put them in the market for a house.
When house shopping, a natural question is "How much is the house worth?" This is difficult to answer because it depends on what the market will bear-the house is worth what someone else is willing to pay for it.
A real estate agent wishes to examine several recent house sales in his territory and develop a model that could be used to give a rough idea of a house's fair market value. Articles on how to determine the value of a house often suggest comparing square footage, number of bedrooms, number of bathrooms, and size of the lot. The agent decided to examine these four variables, along with the age of the house and number of rooms, in an effort to predict the house's selling price.

Price ($1000s) Acres Bedrooms Bathrooms Sq. ft. Age (yr) Rooms 104.9 0.19 3 1.0 900 44 8 109.0 0.15 3 2.0 1431 34 6. 94.9 0.20 3 1.5 1064 49 6 96.5 0.18 1.0 780 52 4 127.9 0.17 3 2.5 1140 47 6. 129.9 0.18 3 1.5 1140 41 7 145.0 0.18 3 1.5 1845 45 6 199.9 0.17 4 3.5 1974 17 8. 255.9 0.24 3.0 2460 22 8 310.0 0.23 4 3.5 2490 14 9. 169.0 0.20 4 2.0 1896 37 8 344.5 0.24 4 4.5 2709 11 8 123.0 0.13 3 1.0 828 63 5 139.9 0.16 2.0 1131 75 169.9 0.15 2 2.0 1002 96 7 194.9 0.19 3. 1.0 1024 55 6. 210.0 0.23 3 1.0 1694 53 9 275.0 0.17 4 2.0 2380 10 8. 299.5 0.17 4 2.0 1936 97 8 319.9 0.27 3. 2.0 1648 77 8 397.5 0.30 4 2.5 2500 106 10 189.9 0.18 1.0 1016 71 6 349.9 0.40 4 2.5 1816 70 8 454.9 0.96 3. 3.0 2160 37 7 499.9 1.00 3.0 3104 48 10 615.0 0.66 4 3.5 3205 26 10 635.0 0.44 4 3.5 3084 27 10 929.0 0.90 4.5 4470 16 14

Step by Step Solution

3.45 Rating (171 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

a b c at least one of the The test statistic is The P value is so reject the null hypothesis and conclude that at least one of the explanatory variables is linearly related to price d The test statist... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Document Format (1 attachment)

Word file Icon

1382-M-S-H-T(6612).docx

120 KBs Word File

Students Have Also Explored These Related Statistics Questions!