Question: 1 . ( 6 points ) This is a real dataset from a credit union ( data here Download here ) , in which the

1.(6 points) This is a real dataset from a credit union (data here Download here), in which the charge-off information for 30,794 customers are kept together with three variables, total monthly income, credit score, current years employed. Charge-off (or bad accounts) is defined as those customers who defaulted on (failed to pay for) their loan from a bank. Good accounts are those customers who are current in their payment. Apply the same intuitive approach introduced in the cancer prediction dataset on this larger dataset to answer the same questions:
How can we tell if any of these three variables can predict the risk of charge-off?
If so, can we tell which one is most predictive?
If so, how can we produce an estimate of risk of charge-off based on these variables?
Use the number of bins =20(22 bins including the overflow and underflow bins) for Intuition 1, and make sure you choose range properly to best show the distributions. Use number of groups =40 for Intuition 2.
2.(4 points) Use basic excel function, if(), or ifs() or vlookup function, pivot table to answer the questions in the first tab of this excel file (here Download here). The data is in the second tab of the same file.
Present the results of your analysis for problem 1 and 2 in one single word document with the sufficient amount of screenshots from your excel file.
Specifically, for problem 1, provide a brief narrative of steps you are taking and observations you make along the way towards implementing the two intuition in this credit union dataset.
I am at a complete loss of how to do this, if anyone can help!

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Finance Questions!