? ? You need to understand and be able to calculate the following three terms to do
Fantastic news! We've Found the answer you've been seeking!
Question:
?
?
Transcribed Image Text:
You need to understand and be able to calculate the following three terms to do this activity. Precision is the fraction of positively predicted outcomes that are actually positive i.e., if the output is predicted to be positive, what is the chance that it is actually positive? . Recall is the fraction of all actual positive data points in our sample that are predicted as positive i.e., out of all the actual positive outcomes, how many have been predicted as positive? Overall accuracy is the fraction of all data points that have been predicted correctly i.e., out of all the data points how many positives have been predicted as positive and how many negatives have been predicted as negative? Note: You can leave all the answers in fractions but putting them in decimals or percentages would make them more interpretable. Before attempting the questions, you need to understand what is across the rows and what is across the columns. You also need to understand what each of the numbers in each of the cells means. Put them in plain English. For example, what is the number "2", what is the number "15"? (Q1) We have built a new spam filter and want to evaluate how good it is. Given below is the confusion matrix for the spam filter for 100 e-mails. Spam Not Spam (a) What is the number 15 here? Put it in plain English Predicted Actual Spam 15 10 Not Spam 5 70 (b) Calculate the Precision for the Spam Filter. What is the interpretation of having this value for precision i.e., How would you explain this to someone who doesn't know how precision is calculated but still uses e-mail and gets spam e-mails? (c) Calculate the recall for the Spam Filter. What is the interpretation of having this value for recall i.e., How would you explain this to someone who doesn't know how recall is calculated but still uses e-mail and gets spam mails? (d) You can see here that the precision is very good for this spam filter but the recall is not so good. What does it mean to have high precision and low recall (Hint: Think about how you interpret precision and recall and apply it to the context of spam filter). What might the possible reason you are seeing these results? (e) What does it mean to have high recall and low precision for a spam filter? Which of the two do you think is better i.e., high precision and low recall or high recall and low precision. (f) What is the overall accuracy of the spam filter? What do you mean when you say this spam filter has this value of accuracy? (Q2) You have the confusion matrix for the performance of a classifier I used to predict a student's grade in the class. These grades are based on the overall scores of the students across different assessments including Quizzes, Home Works, Attendance, in-class activities etc., The range for each of the grades in given below ● A 90-100 ● B 80-89 ● C 70-79 • D below 70 Actual ABCO D A 142L 10 Predicted В B281L C 3 69 3 DUMNE 4 3 2 11 (a) There are two "4" s in this matrix. Provide a plain English description for each of them. (b) How many students do I have in the class? (c) In plain English, define what is "Precision for Grade B" in this context. What is the Precision for Grade C and for Grade D? (d) In plain English, define what is "Recall for Grade C" in this context. What is the Recall for Grade A and for Grade B? What does it mean to have a higher recall for one grade and not the other? (e) What is the overall accuracy? You need to understand and be able to calculate the following three terms to do this activity. Precision is the fraction of positively predicted outcomes that are actually positive i.e., if the output is predicted to be positive, what is the chance that it is actually positive? . Recall is the fraction of all actual positive data points in our sample that are predicted as positive i.e., out of all the actual positive outcomes, how many have been predicted as positive? Overall accuracy is the fraction of all data points that have been predicted correctly i.e., out of all the data points how many positives have been predicted as positive and how many negatives have been predicted as negative? Note: You can leave all the answers in fractions but putting them in decimals or percentages would make them more interpretable. Before attempting the questions, you need to understand what is across the rows and what is across the columns. You also need to understand what each of the numbers in each of the cells means. Put them in plain English. For example, what is the number "2", what is the number "15"? (Q1) We have built a new spam filter and want to evaluate how good it is. Given below is the confusion matrix for the spam filter for 100 e-mails. Spam Not Spam (a) What is the number 15 here? Put it in plain English Predicted Actual Spam 15 10 Not Spam 5 70 (b) Calculate the Precision for the Spam Filter. What is the interpretation of having this value for precision i.e., How would you explain this to someone who doesn't know how precision is calculated but still uses e-mail and gets spam e-mails? (c) Calculate the recall for the Spam Filter. What is the interpretation of having this value for recall i.e., How would you explain this to someone who doesn't know how recall is calculated but still uses e-mail and gets spam mails? (d) You can see here that the precision is very good for this spam filter but the recall is not so good. What does it mean to have high precision and low recall (Hint: Think about how you interpret precision and recall and apply it to the context of spam filter). What might the possible reason you are seeing these results? (e) What does it mean to have high recall and low precision for a spam filter? Which of the two do you think is better i.e., high precision and low recall or high recall and low precision. (f) What is the overall accuracy of the spam filter? What do you mean when you say this spam filter has this value of accuracy? (Q2) You have the confusion matrix for the performance of a classifier I used to predict a student's grade in the class. These grades are based on the overall scores of the students across different assessments including Quizzes, Home Works, Attendance, in-class activities etc., The range for each of the grades in given below ● A 90-100 ● B 80-89 ● C 70-79 • D below 70 Actual ABCO D A 142L 10 Predicted В B281L C 3 69 3 DUMNE 4 3 2 11 (a) There are two "4" s in this matrix. Provide a plain English description for each of them. (b) How many students do I have in the class? (c) In plain English, define what is "Precision for Grade B" in this context. What is the Precision for Grade C and for Grade D? (d) In plain English, define what is "Recall for Grade C" in this context. What is the Recall for Grade A and for Grade B? What does it mean to have a higher recall for one grade and not the other? (e) What is the overall accuracy?
Expert Answer:
Answer rating: 100% (QA)
a 15 First let us see the below confusion matrix from a binary classification mode according to th... View the full answer
Related Book For
Posted Date:
Students also viewed these accounting questions
-
Is it important for business managers to understand and be involved in IT governance? Why or why not?
-
What is one interpretation of a high P/E ratio?
-
What is the interpretation of the direct-material price variance?
-
Jaclyn Hargrove is the owner of six Pickwick Restaurants. For the past 10 years, she has always relied on her accountant to analyze her financial statements. Jaclyn feels that if she were able to...
-
Suppose that a,b R satisfy b/a RZ. Find all q > 0 such that converges. ak +bq k=1
-
A thin, homogeneous wire is bent to form the perimeter of the figure indicated. Locate the center of gravity of the wire figure thus formed. r= 150 mm r= 75 mm
-
Consider a fictitious dataset of \(n=100\) observations with \(s_{y}=80\). We run a regression with three explanatory variables to get \(s=50\). a. Calculate the adjusted coefficient of...
-
Carlton, Weber, and Stansbury share profits equally and have capital balances of $120,000, $70,000, and $80,000, respectively, as of December 31, 2014. Effective January 1, 2015, Stansbury has...
-
Let's suppose you had two potential segments to consider for targeting. Segment A is of moderate market attractiveness and your firm has an advantageous competitive position. Segment B has a high...
-
Halifax Manufacturing allows its customers to return merchandise for any reason and receive a credit to their accounts. All of Halifax's sales are for credit (no cash is collected at the time of...
-
When setting prices for different groups of customers, a manager should charge lower prices to groups that have a more elastic demand. have a more inelastic demand. O have a higher demand. value the...
-
Mileage (Miles per gallon) 45 15 Horse power 100 400 B) Relation between Maximum Speed and Horsepower is shown in the following graph. Maximum Speed (Miles) 120 90 90 Horse power 100 400
-
The Wildhorse, Inc. sold 9,120 season tickets at $2,100 each. By December 31, 2025, 16 of the 40 home games had been played. What amount should be reported as a current liability at December 31,...
-
Kenneth's small business can produce up to 1,100 units. He looks at his profit information from the past year and notes the following: sales of $7,200, variable cost per unit of $4, fixed costs of...
-
1.Time Line-Show the time line for $450 cash outflow today, a $539.55 Cash outflow in year two and a 10% interest rate. 2.One Year Future Value- What is the future value of $600 deposited for one...
-
Why is the concept financial analysis and decision making the most important concept covered from the list below: 1 . The Accounting Cycle ( including issues in accounting ) ; 2 . Accounting for...
-
Jerry has been asked to creat a spread sheet containing details about the documents and data included in the LHR define for disclosure. for example what system contains this information. this would...
-
The Smiths buy a house. They borrow 80 percent of the purchase price from the local ABC Savings and Loan. Before they make their first payment, ABC transfers the right to receive mortgage payments to...
-
Your company, which manufactures roller blades and related equipment and clothing, is interested in gaining a better understanding of its customers. While the company has a CRM system that contains...
-
How would you distinguish between an organizational weakness and a threat to the organization? How would you distinguish between a strength and an opportunity?
-
What percentage of revenue should an organization spend on IT? Explain the rationale for your answer.
-
Based on the photographs in Figure 26.13, in which segment(s) is the Antp gene normally expressed? Figure 26.13: (a) Normal fly (b) Antennapedia mutant
-
The bush baby, a small African mammal, is a remarkable jumper. Although only about 8 inches long, it can jump, from a standing start, straight up to a height of over 7 feet! Use the particle model to...
-
Your friend Travis claims to have set the new world speed record for riding a unicycle. His top speed, he says, was 55 m/s. Do you believe him? Explain.
Study smarter with the SolutionInn App