Consider an ensemble learning algorithm that uses simple majority voting among M learned hypotheses (you may...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Consider an ensemble learning algorithm that uses simple majority voting among M learned hypotheses (you may assume M is odd). Suppose that each hypothesis has error & where 0.5 > >0 and that the errors made by each hypothesis are independent of the others'. Show your work. a. (5 pts) Calculate a formula for the error of the ensemble algorithm in terms of M and E. The ensemble makes an error just in case (M+1)/2 or more hypotheses make an error simultaneously. Recall that the probability that exactly k hypotheses make an error is P(exactly k hypotheses make an error) = (M) *(1-ɛ) (M-k) where (), read "M choose k," is the number of distinct ways of choosing k distinct objects from a set of M distinct objects, calculated as (1) where x!, read "x factorial," is x!= 1*2*3*...*x. Then, M! k!(M-k)! P(error) = E=(M+1)/2 P (exactly k hypotheses make an error) = E=(M+1)/2() Ek(1-E) (M-k) b. (5 pts) Evaluate it for the cases where M = 5, 11, and 21 and c = 0.1, 0.2, and 0.4. M=5 M=11 M=21 0.00856 2.98e-4 1.35e-6 0.0579 0.0117 9.70e-4 0.317 0.247 0.174 -0.1 € 0.2 -0.4 c. (5 pts) If the independence assumption is removed, is it possible for the ensemble error to be worse than &? Produce either an example or a proof that it is not possible. YES. Suppose M=3 and ε = 0.4 = 2/5. Suppose the ensemble predicts five examples el...e5 as follows. el: M1 and M2 are in error, so they out-vote M3 and the prediction of el is in error. e2: M1 and M3 are in error, so they out-vote M2 and the prediction of e2 is in error. e3: M2 and M3 are in error, so they out-vote M1 and the prediction of e3 is in error. e4, e5: None of the hypotheses make an error on e4 or e5, so the predictions of e4 and e5 are correct. The result is that each hypothesis has made 2 errors out of 5 predictions, for an error on each hypothesis of 2/5 = 0.4 = &, as stated. However, the ensemble has made 3 errors out of 5 predictions, for an error on the ensemble of 3/5=0.6>=0.4. Consider an ensemble learning algorithm that uses simple majority voting among M learned hypotheses (you may assume M is odd). Suppose that each hypothesis has error & where 0.5 > >0 and that the errors made by each hypothesis are independent of the others'. Show your work. a. (5 pts) Calculate a formula for the error of the ensemble algorithm in terms of M and E. The ensemble makes an error just in case (M+1)/2 or more hypotheses make an error simultaneously. Recall that the probability that exactly k hypotheses make an error is P(exactly k hypotheses make an error) = (M) *(1-ɛ) (M-k) where (), read "M choose k," is the number of distinct ways of choosing k distinct objects from a set of M distinct objects, calculated as (1) where x!, read "x factorial," is x!= 1*2*3*...*x. Then, M! k!(M-k)! P(error) = E=(M+1)/2 P (exactly k hypotheses make an error) = E=(M+1)/2() Ek(1-E) (M-k) b. (5 pts) Evaluate it for the cases where M = 5, 11, and 21 and c = 0.1, 0.2, and 0.4. M=5 M=11 M=21 0.00856 2.98e-4 1.35e-6 0.0579 0.0117 9.70e-4 0.317 0.247 0.174 -0.1 € 0.2 -0.4 c. (5 pts) If the independence assumption is removed, is it possible for the ensemble error to be worse than &? Produce either an example or a proof that it is not possible. YES. Suppose M=3 and ε = 0.4 = 2/5. Suppose the ensemble predicts five examples el...e5 as follows. el: M1 and M2 are in error, so they out-vote M3 and the prediction of el is in error. e2: M1 and M3 are in error, so they out-vote M2 and the prediction of e2 is in error. e3: M2 and M3 are in error, so they out-vote M1 and the prediction of e3 is in error. e4, e5: None of the hypotheses make an error on e4 or e5, so the predictions of e4 and e5 are correct. The result is that each hypothesis has made 2 errors out of 5 predictions, for an error on each hypothesis of 2/5 = 0.4 = &, as stated. However, the ensemble has made 3 errors out of 5 predictions, for an error on the ensemble of 3/5=0.6>=0.4.
Expert Answer:
Answer rating: 100% (QA)
a The ensemble makes an error just in case M12 or more hypotheses make an error simultaneously The p... View the full answer
Related Book For
Elementary Statistics
ISBN: 978-0538733502
11th edition
Authors: Robert R. Johnson, Patricia J. Kuby
Posted Date:
Students also viewed these computer engineering questions
-
Characterize the errors made by William Blackwell and Madison Wells as either "honest mistakes," negligence, or recklessness. Defend each of your characterizations.
-
Consider the simple regression yt = xt + 1 where E p[ | x] = 0 and E [2 | x ] = 2 (a) What is the minimum mean squared error linear estimator of ? Choose e to minimize Var [] + [E( ?? )]2. The answer...
-
Consider a database with objects X and Y and assume that there are two transactions T1 and T2. Transaction T1 reads objects X and Y and then writes object X. Transaction T 2 reads objects X and Y and...
-
Mrs Anh Thuy is a 43 year old lady admitted following an incidence of blurred vision, numbness down the right side and a sharp pain in her head. A neighbour found her on the ground unable to move or...
-
For each of the following alcohols, give the systematic name and specif) whether the alcohol is primary, sec-ondary, or tertiary. a. b. c. Cl CH:CHCH2CH2 CH2CH2CH, CH3CCH2CH3 CH3 OH
-
During the week of May 10, Hyrum Manufacturing produced and shipped 16,000 units of its aluminum wheels: 4,000 units of Model A and 12,000 units of Model B. The cycle time for Model A is 1.09 hours...
-
Which negotiation strategy has the highest risk (possibility of making the customer angry)? Which strategy do you think has the lowest risk (is most effective with customers)?
-
The ages of 20 dogs in a pet shelter are shown. Construct a frequency distribution using 7 classes. 3 6. 4 4 9. 4 3 4 9.
-
1. Analyze the reasons China has a twin surplus. 2. What are the repercussions of this fact over a long period of time? 3.I just need help wtih the consequences.
-
John Fuji (birthdate June 6, 1981) received the following Form W-2 from his employer related to his job as a manager at a Washington apple-processing plant: Johns other income includes interest on a...
-
West Corp. issued 25-year bonds two years ago at a coupon rate of 5.3 percent. The bonds make semiannual payments. If these bonds currently sell for 105 percent of par value, what is the YTM? DTO,...
-
Comparing the future value of two investments you HAVE $ 4 5 0 to invest every month for the next 5 years. there are two alternative investments you are being offered: Investment A offers an...
-
Assume that the City of Coyote has produced its financial statements for December 3 1 , 2 0 2 4 , and the year then ended. The city s general fund was only used to monitor education and parks. Its...
-
Imagine you are purchasing a late model vehicle for $15,000. You have saved $3,000 cash to purchase the vehicle. You have no trade-in vehicle. Sales tax of 7% will have to paid on the purchase price...
-
For real estate developemnt company explain the different ways to collect the data (market research )? either Outsourced or in-house? and Why?
-
Name the document that the Advertising Standards of Canada association uses to provide the ethical guidelines to both advertisers and advertising agencies.
-
3 ) ) Which one of the following memory organisations is impossible and explain why? 10-bit address, 1024 cells, 8-bit cell size 10-bit address, 1024 cells, 12-bit cell size 9-bit address, 1024...
-
Subtract the polynomials. (-x+x-5) - (x-x + 5)
-
A computer was used to construct this dotplot below. a. How many data values are shown? b. List the values of the five smallest data. c. What is the value of the largest data item? d. What value...
-
Many organizations offer special magazine rates to their members. The American Federation of Teachers is no different, and here are a few of the rates they offer their members. a. Construct a scatter...
-
Ronald Fisher, an English statistician (18901962), collected measurements for a sample of 150 irises. Of concern were five variables: species, petal width (PW), petal length (PL), sepal width (SW),...
-
The preferred stock of the Luxemburg Mining Corporation pays a $3.25 dividend. What is the value of the stock if your required rate of return is 8 percent?
-
What is the value of Brunei Petroleum Companys preferred stock when the dividend rate is 18 percent on a $100 par value? The appropriate discount rate for a stock of this risk level is 14 percent.
-
What is the value of Queens Park PLCs preferred stock when the dividend rate is 10 percent on a $100 par value? The appropriate discount rate for a stock of this risk level is 16 percent.
Study smarter with the SolutionInn App