Question: Data Mining Question Consider a test data of 1000 samples with two classes: + class (100 samples) and class (900 samples). We have two random

Data Mining Question

Consider a test data of 1000 samples with two classes: + class (100 samples) and class (900 samples). We have two random classifiers C1 and C2. Classifier C1 classifies test data to + class randomly with a probability p and classifier C2 classifies test data to + class randomly with a probability 2p.

a) What is the expected TPR and FPR for C1 and C2?

b) Is C2 a better classifier than C1? Hint: The random guess line in an ROC curve corresponds to TPR = FPR.

c) Expected precision for both C1 and C2 is 1/10. Expected recall for C2 is twice than that of C1 (2p and p respectively). If we use precision and recall as the evaluation metrics, C2 appears to be a better classifier than C1. Which evaluation metric pair between {TPR and FPR} and {precision and recall} do you think is correctly indicating the relative performance of C2 and C1?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Q 2 . Classification Evaluation Measures ( Total 6 pt ) Consider a test data of 1 0 0 0 samples with two classes: + class ( 1 0 0 samples ) and class ( 9 0 0 samples ) . We have two random...

Task 1: Distance Map Requires: knowing how to design and implement a class In file distance_map.py, use the Class Design Recipe to define a class called DistanceMap that lets client code store and...

Fundamentals of Analytics and Business Intelligence Maximum Score: 25 Points Please answer all questions in the space provided and submit your solutions on Blackboard by 11:59 pm on October 10. Late...

I have already completed part one shown below. I need help with part 2: MonthlyCostComparator.java, MonthlyCostComparatorTest.java, and CloudStoragePart2.java, CloudStoragePart2Test.java Thank you,...

Consider the symmetric simple random walk at two timepoints: n and n + m. Find ?(Sn, Sn+m). What happens as m ? ? for fixed n and as n ? ? for fixed m? Explain intuitively. 23 Consider the simple...

Possible Multiple Choice Questions for the Exam. Focus on the topics discussed in class. Chapter 1 Multiple Choice Identify the choice that best completes the statement or answers the question. ____...

Exercises Chapter 2 2.1 Marginal and conditional probability: The social mobility data from Section 2.5 gives a joint probability distribution on (Y1 , Y2 )= (father's occupation, son's occupation)....

Follow the steps given in Machine Learning With R , Chapter 5, section "Example Identifying Risky Bank Loans Using C5.0 Decision Trees." download the credit. csv file from Packt Publishing's website...

Inference for Relationships Checkpoint Question 1 Select one answer. 10 points A psychologist wants to study whether there is a difference in the effectiveness of two different talk therapy...

Midterm Examination Questions - Spring 2016 1. The final examination grades of random samples of students from three different classes are shown below. Class A Class B Class C 92 91 85 85 85 93 96 90...

Determining flexible budget variances Use the standard price and cost data provided in Exercise 8-3A. Assume that the actual sales price is $7.65 per unit and that the actual variable cost is $4.25...

Nickel carbonyl, Ni (CO)4, is one of the most toxic substances known. The present maximum allowable concentration in laboratory air during an 8-hr workday is 1 ppb (parts per billion) by volume,...

An analyst has determined the NPV of a new product launch to be $ 1 0 . 0 million. In the NPV analysis, the analyst used the _ _ _ _ _ _ _ _ _ _ as the discount rate. In doing so , the analyst...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

=+2 What problems do you see for MNEs like Ford Motor Company that must bargain with unions in multiple countries?

=+j on to staff their operations in the global marketplace.

=+3 What do you predict for the future of unions and union relations in the global economy? Why?