# Question

Refer to the scenario described in Problem 10 and the file BlueOrRed. Partition the data into training (50 percent), validation (30 percent), and test (20 percent) sets. Use logistic regression to classify observations as undecided (or decided) using Age, HomeOwner, Female, Married, HouseholdSize, Income, and Education as input variables and Undecided as the output variable. Perform an exhaustive-search best subset selection with the number of subsets equal to 2.

a. From the generated set of logistic regression models, select one that you believe is a good fit. Express the model as a mathematical equation relating the output variable to the input variables.

b. Increases in which variables increase the chance of a voter being undecided? Increases in which variables decrease the chance of a voter being decided?

c. Using the default cutoff value of 0.5 for your logistic regression model, what is the overall error rate on the test data?

d. Examine the decile-wise lift chart for your model on the test data. What is the first decile lift? Interpret this value.

a. From the generated set of logistic regression models, select one that you believe is a good fit. Express the model as a mathematical equation relating the output variable to the input variables.

b. Increases in which variables increase the chance of a voter being undecided? Increases in which variables decrease the chance of a voter being decided?

c. Using the default cutoff value of 0.5 for your logistic regression model, what is the overall error rate on the test data?

d. Examine the decile-wise lift chart for your model on the test data. What is the first decile lift? Interpret this value.

## Answer to relevant Questions

Telecommunications companies providing cell phone service are interested in customer retention. In particular, identifying customers who are about to churn (cancel their service) is potentially worth millions of dollars if ...Each year, the American Academy of Motion Picture Arts and Sciences recognizes excellence in the film industry by honoring directors, actors, and writers with awards (called the Oscars) in different categories. The most ...Refer to the clustering problem involving the file FBS described in Problem 1. The NCAA has a preference for conferences consisting of similar schools with respect to their endowment, enrollment, and football stadium ...Professor Rao would like to accurately calculate the grades for the 58 students in his Operations Planning and Scheduling class (OM 455). He has thus far constructed a spreadsheet, part of which follows: The file RSR ...An auto dealership is advertising that a new car with a sticker price of $35,208 is on sale for $25,995 if payment is made in full, or it can be financed at 0 percent interest for 72 months with a monthly payment of $489. ...Post your question

0