Question: Use RStudio to solve the following problems. Help with part B please!! 6. Suppose we are running a fraud classification model, with a training set

Use RStudio to solve the following problems. Help with part B please!!

6. Suppose we are running a fraud classification model, with a training set of 10,000 records of which only 400 are fraudulent.

a) How many fraudulent records need to be resampled if we would like the proportion of fraudulent records in the balanced data set to be 20%?

2,000 need to be resampled

b) How many non-fraudulent records need to be set aside if we would like the proportion of fraudulent records in the balanced data set to be 20%?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!