? ? You need to understand and be able to calculate the following three terms to do
Fantastic news! We've Found the answer you've been seeking!
Question:
?
?
Transcribed Image Text:
You need to understand and be able to calculate the following three terms to do this activity. Precision is the fraction of positively predicted outcomes that are actually positive i.e., if the output is predicted to be positive, what is the chance that it is actually positive? . Recall is the fraction of all actual positive data points in our sample that are predicted as positive i.e., out of all the actual positive outcomes, how many have been predicted as positive? Overall accuracy is the fraction of all data points that have been predicted correctly i.e., out of all the data points how many positives have been predicted as positive and how many negatives have been predicted as negative? Note: You can leave all the answers in fractions but putting them in decimals or percentages would make them more interpretable. Before attempting the questions, you need to understand what is across the rows and what is across the columns. You also need to understand what each of the numbers in each of the cells means. Put them in plain English. For example, what is the number "2", what is the number "15"? (Q1) We have built a new spam filter and want to evaluate how good it is. Given below is the confusion matrix for the spam filter for 100 e-mails. Spam Not Spam (a) What is the number 15 here? Put it in plain English Predicted Actual Spam 15 10 Not Spam 5 70 (b) Calculate the Precision for the Spam Filter. What is the interpretation of having this value for precision i.e., How would you explain this to someone who doesn't know how precision is calculated but still uses e-mail and gets spam e-mails? (c) Calculate the recall for the Spam Filter. What is the interpretation of having this value for recall i.e., How would you explain this to someone who doesn't know how recall is calculated but still uses e-mail and gets spam mails? (d) You can see here that the precision is very good for this spam filter but the recall is not so good. What does it mean to have high precision and low recall (Hint: Think about how you interpret precision and recall and apply it to the context of spam filter). What might the possible reason you are seeing these results? (e) What does it mean to have high recall and low precision for a spam filter? Which of the two do you think is better i.e., high precision and low recall or high recall and low precision. (f) What is the overall accuracy of the spam filter? What do you mean when you say this spam filter has this value of accuracy? (Q2) You have the confusion matrix for the performance of a classifier I used to predict a student's grade in the class. These grades are based on the overall scores of the students across different assessments including Quizzes, Home Works, Attendance, in-class activities etc., The range for each of the grades in given below ● A 90-100 ● B 80-89 ● C 70-79 • D below 70 Actual ABCO D A 142L 10 Predicted В B281L C 3 69 3 DUMNE 4 3 2 11 (a) There are two "4" s in this matrix. Provide a plain English description for each of them. (b) How many students do I have in the class? (c) In plain English, define what is "Precision for Grade B" in this context. What is the Precision for Grade C and for Grade D? (d) In plain English, define what is "Recall for Grade C" in this context. What is the Recall for Grade A and for Grade B? What does it mean to have a higher recall for one grade and not the other? (e) What is the overall accuracy? You need to understand and be able to calculate the following three terms to do this activity. Precision is the fraction of positively predicted outcomes that are actually positive i.e., if the output is predicted to be positive, what is the chance that it is actually positive? . Recall is the fraction of all actual positive data points in our sample that are predicted as positive i.e., out of all the actual positive outcomes, how many have been predicted as positive? Overall accuracy is the fraction of all data points that have been predicted correctly i.e., out of all the data points how many positives have been predicted as positive and how many negatives have been predicted as negative? Note: You can leave all the answers in fractions but putting them in decimals or percentages would make them more interpretable. Before attempting the questions, you need to understand what is across the rows and what is across the columns. You also need to understand what each of the numbers in each of the cells means. Put them in plain English. For example, what is the number "2", what is the number "15"? (Q1) We have built a new spam filter and want to evaluate how good it is. Given below is the confusion matrix for the spam filter for 100 e-mails. Spam Not Spam (a) What is the number 15 here? Put it in plain English Predicted Actual Spam 15 10 Not Spam 5 70 (b) Calculate the Precision for the Spam Filter. What is the interpretation of having this value for precision i.e., How would you explain this to someone who doesn't know how precision is calculated but still uses e-mail and gets spam e-mails? (c) Calculate the recall for the Spam Filter. What is the interpretation of having this value for recall i.e., How would you explain this to someone who doesn't know how recall is calculated but still uses e-mail and gets spam mails? (d) You can see here that the precision is very good for this spam filter but the recall is not so good. What does it mean to have high precision and low recall (Hint: Think about how you interpret precision and recall and apply it to the context of spam filter). What might the possible reason you are seeing these results? (e) What does it mean to have high recall and low precision for a spam filter? Which of the two do you think is better i.e., high precision and low recall or high recall and low precision. (f) What is the overall accuracy of the spam filter? What do you mean when you say this spam filter has this value of accuracy? (Q2) You have the confusion matrix for the performance of a classifier I used to predict a student's grade in the class. These grades are based on the overall scores of the students across different assessments including Quizzes, Home Works, Attendance, in-class activities etc., The range for each of the grades in given below ● A 90-100 ● B 80-89 ● C 70-79 • D below 70 Actual ABCO D A 142L 10 Predicted В B281L C 3 69 3 DUMNE 4 3 2 11 (a) There are two "4" s in this matrix. Provide a plain English description for each of them. (b) How many students do I have in the class? (c) In plain English, define what is "Precision for Grade B" in this context. What is the Precision for Grade C and for Grade D? (d) In plain English, define what is "Recall for Grade C" in this context. What is the Recall for Grade A and for Grade B? What does it mean to have a higher recall for one grade and not the other? (e) What is the overall accuracy?
Expert Answer:
Answer rating: 100% (QA)
a 15 First let us see the below confusion matrix from a binary classification mode according to th... View the full answer
Related Book For
Posted Date:
Students also viewed these accounting questions
-
Is it important for business managers to understand and be involved in IT governance? Why or why not?
-
What is one interpretation of a high P/E ratio?
-
What is the interpretation of the direct-material price variance?
-
Jaclyn Hargrove is the owner of six Pickwick Restaurants. For the past 10 years, she has always relied on her accountant to analyze her financial statements. Jaclyn feels that if she were able to...
-
Stanford-Binet IQ Test scores are normally distributed with a mean score of 1(H) and a standard deviation of 16. a. Sketch the distribution of Stanford-Binet IQ test scores. b. Write the equation...
-
Define the following: shooting rights G&G costs carrying costs dry-hole contribution bottom-hole contribution
-
The velocity ratio of third systemof pulleys is _____.
-
These are selected 2014 transactions for Amarista Corporation: Jan. 1 Purchased a copyright for $120,000. The copyright has a useful life of 6 years and a remaining legal life of 30 years. Mar. 1...
-
Read the Testing the Nervous System case study . Answer each question and write a report for the case study. Which of the test results indicated a brain injury and why? Which of the test results...
-
A major pharmaceutical wholesaler buys brand drugs from a manufacturer at wholesale prices and sells them to pharmacies at retail prices. It estimates that the wholesale (W) price, the retail (R)...
-
The board of directors of JinFeng Inc .( JF ), which adopts cleaner production technology, is discussing an incentive plan drafted by its CEO, Michael Roberts. According to the plan, a 1% of the...
-
Calculate the Net Present Value of an investment with the following cash flow information: Cost of Capital 2%, Cash Outflow Year 0 = $150,000 Cash Inflows Year 1 = $25,000 Year 2 = $30,000 Year 3 =...
-
Why do shareholders of a company with poor quality of corporate governance demand high payout, whenever possible?
-
Question- Toyota Corporation paid $3 dividend last year. The dividend is expected to grow at a rate of 2% for the coming 3 years and 2% thereafter and forever. The required rate of return is 9%. [6...
-
A pharmaceutical company has R1 million allocated for the following capital projects: Project Investment (R'000) NPV (R'000) 1 300 66 2 200 -4 3 250 43 4 100 14 5 100 7 6 350 63 7 400 48 The...
-
Budgeted manufacturing overhead costs $4,300,000 Budgeted machine-hours 172,000 Actual manufacturing overhead costs $4,140,000 Actual machine-hours 166,000 pop-up content ends. 1. Calculate the...
-
f(2+ h)-f(2) Let f(x)=5x+2x+4 and let g(h): =1 h Determine each of the following: (a) g(1) = (b) g(0.1) = (c) g(0.01) = You will notice that the values that you entered are getting closer and closer...
-
The Smiths buy a house. They borrow 80 percent of the purchase price from the local ABC Savings and Loan. Before they make their first payment, ABC transfers the right to receive mortgage payments to...
-
Your company, which manufactures roller blades and related equipment and clothing, is interested in gaining a better understanding of its customers. While the company has a CRM system that contains...
-
How would you distinguish between an organizational weakness and a threat to the organization? How would you distinguish between a strength and an opportunity?
-
What percentage of revenue should an organization spend on IT? Explain the rationale for your answer.
-
This chapter discusses the use of job redesign to reduce turnover. Do you think this is feasible in this case? Why or why not? If so, how should the job be redesigned?
-
What other solutions could you see be effective at improving employee motivation and reducing the turnover rate? Why do you believe these solutions would be useful?
-
Should the whole team have decided on the team members schedule accommodations collectively? Why or why not?
Study smarter with the SolutionInn App