CD Real Estate specializes in residential real estate services in the state of California. To complement the

Question:

CD Real Estate specializes in residential real estate services in the state of California. To complement the experience and local market knowledge of its licensed realtors, CD Real Estate wants to develop an analytical tool to predict the value of real estate. The file calireal contains data on some census tracts in California. The variables in these data are listed in Problem 20.

Predict the individuals’ credit scores using a k-nearest neighbors. Set aside 50% of the data as a test set and use 50% of the data for training and validation.

a. Based on all the input variables, determine the value of k that minimizes the RMSE in a validation procedure.

b. Experiment with different subsets of variables as input features and re-calibrate the value of k to minimize the RMSE. How does this k-nearest neighbors model compare to the model obtained in part (a)?

c. For the best-performing k-nearest neighbors model in the validation procedure, what is the RMSE on the test set?


Problem 20

CD Real Estate specializes in residential real estate services in the state of California. To complement the experience and local market knowledge of its licensed realtors, CD Real Estate wants to develop an analytical tool to predict the value of real estate. The file calireal contains data on some census tracts in California. The variables in these data are listed in the following table.image text in transcribed

Predict the median house value using an individual regression tree. Set aside 50% of the data as a test set and use 50% of the data for training and validation.

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  book-img-for-question

Business Analytics

ISBN: 9780357902219

5th Edition

Authors: Jeffrey D. Camm, James J. Cochran, Michael J. Fry, Jeffrey W. Ohlmann

Question Posted: