Question: Suppose that we have an imbalanced dataset in a binary classification problem, with many more datapoints with a true label of class 1 than with
Suppose that we have an imbalanced dataset in a binary classification problem, with many more datapoints with a true label of class than with a true label of class eg class is the "majority" class, and class is the "minority" class We would like to use a costsensitive method to train our model, which penalizes the model times as much for predicting a datapoint is in class when it is truly in class than for predicting a datapoint is in class when it is truly in class eg following the notation from the lecture videos, cc The model is of course not penalized for making correct predictions.
In this case, suppose our trained model predicts that a particular test datapoint has probability p of being in class TrueFalse: The model assigns this test datapoint to class based on the optimal threshold that corresponds to the costsensitive method.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
