# Question

Banks monitor the use of credit cards to see whether promotions that encourage customers to charge more have been successful. Banks also monitor balances to seek out fraud or cases of stolen credit card numbers. For these methods to work, we need to be able to anticipate the balance on a credit card next month from things that we know today. A key in-dicator of the balance next month is the balance this month. Those balances that roll over earn the bank high rates of interest, often in the neighborhood of 12% to 24% annually.

The data table for this analysis shows a sample of balances for 923 customers of a national issuer of credit cards, such as Visa and Mastercard. The four columns in the data table are balances over four consecutive months. For this analysis, we’ll focus on the relationship between the balance in the third and fourth months.

Motivation

(a) The bank would like to predict the balance of a customer in month 4 from the balance in month 3 to within 10% of the actual value. How well must a linear equation describe the data in order to meet this goal?

(b) Explain in management terms how an equation that anticipates the balance next month based on the current balance could be useful in evaluating the success of a marketing program intended to increase customer account balances.

Method

(c) Form the appropriate scatterplot of the balances in months 3 and 4. Does a linear model seem like a decent way to describe the association?

(d) The scatterplot reveals two types of outliers that deviate from the general pattern. What is the explanation for these outliers?

Mechanics

(e) Fit the linear equation using all of the cases. Briefly summarize the estimated slope and intercept as well as the overall summary statistics, R2 and se.

(f) Exclude the cases that have a near-zero balance in either month 3 or month 4 (or both). Interpret “near zero” to mean an account with balance $25 or less. Refit the equation and compare the results to the equation obtained in part (e). Do the results change in a meaningful way?

(g) Inspect the residuals from the equation ft in part (f). Do these suggest simple variation?

Message

(h) Summarize the ft of your equation for management. Explain in clear terms any relevant symbols. (That is, don’t just give a value for r2 and expect that the reader knows regression analysis.)

(i) Is the goal of predicting the balance next month within 10% possible? Indicate, for management, why or why not.

The data table for this analysis shows a sample of balances for 923 customers of a national issuer of credit cards, such as Visa and Mastercard. The four columns in the data table are balances over four consecutive months. For this analysis, we’ll focus on the relationship between the balance in the third and fourth months.

Motivation

(a) The bank would like to predict the balance of a customer in month 4 from the balance in month 3 to within 10% of the actual value. How well must a linear equation describe the data in order to meet this goal?

(b) Explain in management terms how an equation that anticipates the balance next month based on the current balance could be useful in evaluating the success of a marketing program intended to increase customer account balances.

Method

(c) Form the appropriate scatterplot of the balances in months 3 and 4. Does a linear model seem like a decent way to describe the association?

(d) The scatterplot reveals two types of outliers that deviate from the general pattern. What is the explanation for these outliers?

Mechanics

(e) Fit the linear equation using all of the cases. Briefly summarize the estimated slope and intercept as well as the overall summary statistics, R2 and se.

(f) Exclude the cases that have a near-zero balance in either month 3 or month 4 (or both). Interpret “near zero” to mean an account with balance $25 or less. Refit the equation and compare the results to the equation obtained in part (e). Do the results change in a meaningful way?

(g) Inspect the residuals from the equation ft in part (f). Do these suggest simple variation?

Message

(h) Summarize the ft of your equation for management. Explain in clear terms any relevant symbols. (That is, don’t just give a value for r2 and expect that the reader knows regression analysis.)

(i) Is the goal of predicting the balance next month within 10% possible? Indicate, for management, why or why not.

## Answer to relevant Questions

1. Reciprocal transformation 2. Opposite of log transformation 3. Slope estimates elasticity 4. 1% increase in x 5. Approaches b0 as x gets large 6. Random variation 7. Diminishing marginal return 8. Pattern unsuited to log ...These data contain quarterly net income (in millions of dollars) of Target from 1990 through the end of 2011 (88 quarters). Net income is the difference between sales and the cost of running the business including both the ...This exercise uses base 10 logs instead of natural logs. (a) Using the pet food data of the text example, trans-form the price and volume data using natural logs and then using base 10 logs. Then plot the natural log of ...This figure shows an estimated linear equation with 95% prediction intervals. Assume that the units of X and Y are dollars. (a) Visually estimate the intercept b0 and the slope b1. (b) Is se (b1) less than 1, about 1, or ...It has become common to find a convenience store that also sells gas. These data describe the sales over time at a franchise outlet of a major U.S. oil company. Each row summarizes sales for one day. This particular station ...Post your question

0