# 3. The Sunday April 15, 2007 issue of the Houston Chronicle included a section devoted to real estate prices in Houston. In particular, data are

3. The Sunday April 15, 2007 issue of the Houston Chronicle included a section devoted to real estate prices in Houston. In particular, data are presented on the 2006 median price per square foot for 1922 subdivisions. The data (HoustonRealEstate.txt) can be found on the book web site. Interest centers on developing a regression model to predict Y, = 2006 median price per square foot from x\". = %NewHomes (i.e., of the houses that sold in 2006, the percentage that were built in 2005 or 2006) x2, = %F0reclosures (i.e., 0f the houses that sold in 2006, the percentage that were identied as foreclosures) for the i = 1, ... 1922 subdivisions. The first model considered was Y, = Bot Bix ,i + B2x 2i te (4.6) Model (4.6) was fit used weighted least squares with weights, w = n where n. = the number of homes sold in subdivision i in 2006. Output from model (4.6), in the form of plots, appears in Figure 4.1. (a) Explain it is necessary to use weighted least squares to fit model (4.6) and why w, = n, is the appropriate choice for the weights. (b) Explain why (4.6) is not a valid regression model. (c) Describe what steps you would take to obtain a valid regression model (Figure 4.1).

