A university is applying classification methods in order to identify alumni who may be interested in...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
A university is applying classification methods in order to identify alumni who may be interested in donating money. The university has a database of 58,205 alumni profiles containing numerous variables. Of these 58,205 alumni, only 576 have donated in the past. The university has oversampled the data and trained a random forest of 100 classification trees. For a cutoff value of 0.5, the following confusion matrix summarizes the performance of the random forest on a validation set: Actual Donation No Donation 23 23,384 No Donation The following table lists some information on individual observations from the validation set: Predicted Donation 265 5,430 Observation ID Probability of Donation 0.4 0.8 No Donation. Donation Donation 0.6 (a) Choose the correct explanation for how the probability of Donation was computed for the three observations. (i) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "Donation" and those that classified it as "No Donation." (ii) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "No Donation and those that classified it as "Donation." (iii) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "Donation." (iv) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "No Donation." Option () v Why were Observations B and C classified as Donation and Observation A was classified as No Donation? Actual Class No Donation Donation No Donation If required, round your answers to one decimal place The probability of Donation for Observation A is The probability of Donation for Observation B is The probability of Donation for Observation C is If required, round your answer to three decimal places. Accuracy Predicted Class It is i It is It is 1 (b) Compute the values of accuracy, sensitivity, specificity, and precision. Explain why accuracy is a misleading measure to consider in this case. Evaluate the performance of the random forest, particularly commenting on the precision measure. If required, round your answers to the nearest whole percentage. Accuracy is not the best measure to use for unbalanced data sets because less than The value of precision seems disturbingly ✓than 0.5, so Observation A is classified as No Donation by the random forest. ✓than 0.5, so Observation 8 is classified as Donation by the random forest. ✓than 0.5, so Observation C is classified as Donation by the random forest. % of the alumni in the data have donated. If required, round your answers for Sensitivity and Specificity to three decimal places and round your answer for Precision to four decimal places. Sensitivity Specificity Precision The precision measure represents the percentage of alumni classified by the random forest as Va tremendous improvement in the ability to target alumni who may be more likely to donate. ✓that are donors. Comparing the value of precision with the proportion of observations corresponding to donations, there A university is applying classification methods in order to identify alumni who may be interested in donating money. The university has a database of 58,205 alumni profiles containing numerous variables. Of these 58,205 alumni, only 576 have donated in the past. The university has oversampled the data and trained a random forest of 100 classification trees. For a cutoff value of 0.5, the following confusion matrix summarizes the performance of the random forest on a validation set: Actual Donation No Donation 23 23,384 No Donation The following table lists some information on individual observations from the validation set: Predicted Donation 265 5,430 Observation ID Probability of Donation 0.4 0.8 No Donation. Donation Donation 0.6 (a) Choose the correct explanation for how the probability of Donation was computed for the three observations. (i) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "Donation" and those that classified it as "No Donation." (ii) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "No Donation and those that classified it as "Donation." (iii) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "Donation." (iv) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "No Donation." Option () v Why were Observations B and C classified as Donation and Observation A was classified as No Donation? Actual Class No Donation Donation No Donation If required, round your answers to one decimal place The probability of Donation for Observation A is The probability of Donation for Observation B is The probability of Donation for Observation C is If required, round your answer to three decimal places. Accuracy Predicted Class It is i It is It is 1 (b) Compute the values of accuracy, sensitivity, specificity, and precision. Explain why accuracy is a misleading measure to consider in this case. Evaluate the performance of the random forest, particularly commenting on the precision measure. If required, round your answers to the nearest whole percentage. Accuracy is not the best measure to use for unbalanced data sets because less than The value of precision seems disturbingly ✓than 0.5, so Observation A is classified as No Donation by the random forest. ✓than 0.5, so Observation 8 is classified as Donation by the random forest. ✓than 0.5, so Observation C is classified as Donation by the random forest. % of the alumni in the data have donated. If required, round your answers for Sensitivity and Specificity to three decimal places and round your answer for Precision to four decimal places. Sensitivity Specificity Precision The precision measure represents the percentage of alumni classified by the random forest as Va tremendous improvement in the ability to target alumni who may be more likely to donate. ✓that are donors. Comparing the value of precision with the proportion of observations corresponding to donations, there
Expert Answer:
Answer rating: 100% (QA)
ANSWER a The correct explanation for how the probability of Donation was computed for the three observations is 3 The probability of Donation for each ... View the full answer
Related Book For
Business Analytics Communicating With Numbers
ISBN: 9781260785005
1st Edition
Authors: Sanjiv Jaggia, Alison Kelly, Kevin Lertwachara, Leida Chen
Posted Date:
Students also viewed these accounting questions
-
The State of Connecticut unfortunately has no major professional sports team. Hartford because of location, population, highways, an airport, and prior sports history is a suitable city to host a...
-
Table 118 identifies twenty-six steps in project planning and control. Below is a description of each of the twenty-six steps. Using this information, fill in columns 1 and 2 (column 2 is a group...
-
Read the following article and answer the questions below: Building Competitive Advantage Through People Magazine: Winter 2002Research Feature January 15, 2002 Reading Time: 23 min Christopher A....
-
What is your debt payments-to-income ratio if your debt payments total $684 and your net income is $2,000 per month?
-
Many MBAs who ventured into the ??dot-com?? world of the late 1990s found themselves unemployed by 2001 as many firms in that industry ceased to exist. However, during their tenure with these...
-
Oregon Outfitters, a manufacturer of fly fishing flies, uses a normal-costing system with a single overhead cost pool and direct labor cost as the cost-allocation base. The following data are for...
-
The disadvantage(s) of the Wilson equation over the van Laar equation is/are that (a) It can not be applied for the systems in which \(\gamma\) shows maxima or minima (b) It is not suitable for the...
-
Cherry Jones owns a homeopathic medicine company called Faithhealers. She sells vitamins and other relatively nonperishable products for those who want choices regarding alternative medicine. Cherry...
-
Access and listen to the podcast below from NPR's Planet Money regarding credit reports. Episode 798: Bad Credit BureauLinks to an external site Reference:...
-
Russian is an Indo-European language of the Slavic family, spoken in Russia. Determine from the following Russian data whether the low front [a] and the low back [a] complement each other as...
-
2. Which property of the ASCII character code allows lower-case letters to easily be converted to upper-case letters (a A etc) in ASCII representation? 3. Explain, with an 8-bit example, how you can...
-
You started a company that manufactures belts and sells them for $200 each. If you need to sell 2000 belts per month to break even and fixed costs per month are $30000, what is the variable cost per...
-
Companies A and B have been offered the following rates per annum on a $20 million 5-year loan, and a bank, acting as intermediary, will charge 0.10% per annum (10 basis points) to arrange and manage...
-
Izzy has been married to her wife(she/her/hers), white woman who also identifies as part of the LGBTIA+ community, for 2 years and they are expecting their first child. Neither Izzy nor her wife...
-
Explain what an arbitrageur would do in the following circumstances. $/SF exchange rate is $.51/SF, the Swiss risk-free rate is 4% per year, the US risk free rate is 6% per year, and a SF Call option...
-
Frankie bought a trip on her credit card on May 28 for $2,231.00. She paid off the entire amount of the trip including the interest charges 128 days later. The annual interest rate on purchases is...
-
Nanette works for Piroz and is paid a basic wage of $1,000 a week. Piroz operates the following bonus scheme: (1) Each employee gets a bonus of $4 for every unit they produce in excess of 2,000 units...
-
What is taxable income, and what is the formula for determining taxable income?
-
Perform k-means clustering on the accompanying data set. Use variables x 1 , x 3 , and x 5 in the analysis. Do not standardize the variables. a. Set the number of clusters to 3. What are the size and...
-
It is generally believed that the wealthier a students family, the higher the students Scholastic Aptitude Test (SAT) score. Another commonly used predictor for SAT scores is the students grade point...
-
Perform k-means clustering on the accompanying data set. Use variables x 4 , x 5 , x 6 , and x 7 , standardized to z-scores, in the analysis. a. Specify the k value as 2 and plot the cluster...
-
A construction contract differs from contracts that we generally deal with that focus on an easily defined physical object because the physical object can be examined. How is the object of a...
-
What does the owner contribute to the project and what does the contractor contribute to the project?
-
For what type of project is a line-of-balance schedule particularly suited? Identify specific examples.
Study smarter with the SolutionInn App