Question: The Credit Card Fraud data is a small version (comprised of 12,240 records) of a much larger data set (containing 248,807 records); it is made

The Credit Card Fraud data is a small version (comprised of 12,240 records) of a much larger data set (containing 248,807 records); it is made up of 2013 European transactions. It is a very unbalanced data set in which there are only a few fraudulent transactions. Attempting to classify transactions as fraudulent will be difficult since there are very few instances of fraud.

Use Logit and a kNN model to create a predictive model for the Credit Card Fraud data. Does either of these models have predictive power? 

Explain carefully the information provided by the lift chart or the decile-wise lift chart; how does this information differ from the information provided by the overall misclassification rate? 

What value to a firm could you see in creating such a model and using it in real time?

Step by Step Solution

3.42 Rating (149 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Forecasting Predictive Analytics Questions!