Question: Refer to the scenario in Problem 47 regarding the identification of churning cellphone customers. Apply k-nearest neighbors to classify observations as churning or not by

Refer to the scenario in Problem 47 regarding the identification of churning cellphone customers. Apply k-nearest neighbors to classify observations as churning or not by using Churn as the target (or response) variable. Set aside 20% of the data as a test set and use 80% of the data for training and validation.

a. Based on all the input variables, determine the value of k that maximizes the AUC in a validation procedure.

b. Remove DataPlan, RoamMins, OverageFee, and AccountWeeks from the set of input features considered and re-calibrate the value of k that maximizes the AUC.
How does this k-nearest neighbors model compare to the model obtained in part (a)?

c. For the best-performing k-nearest neighbors model in the validation procedure (with respect to AUC), compute and interpret the lift on the top 10% of test set observations most likely to churn.

 Problem 47

Telecommunications companies providing cell-phone service are interested in customer retention. In particular, identifying customers who are about to churn (cancel their service) is potentially worth millions of dollars if the company can proactively address the reason that customer is considering cancellation and retain the customer. Data on past customers, some of whom churned and some who did not, have been collected. The variables in this data set are listed in the following table.Variable AccountWeeks Description number of weeks customer has had active account ContractRenewal

Apply logistic regression with lasso regularization to classify observations as churning or not by using Churn as the target (or response) variable. Set aside 20% of the data as a test set and use 80% of the data for training and validation.

Variable AccountWeeks Description number of weeks customer has had active account ContractRenewal 1 if customer recently renewed contract, O if not DataPlan Data Usage CustServCalls DayMins DayCalls MonthlyCharge OverageFee Roam Mins Churn 1 if customer has data plan, 0 if not gigabytes of monthly data usage number of calls into customer service average daytime minutes per month average number of daytime calls average monthly bill largest overage fee in last 12 months average number of roaming minutes "Yes" if customer cancelled service, "No" if not

Step by Step Solution

3.41 Rating (170 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Business Analytics Data Questions!