Suppose that you take a data set and divide it into two parts of equal size....
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Suppose that you take a data set and divide it into two parts of equal size. The first part (Part I) and the second part (Part II). You will try out two different classification procedures, by using Part I and Part II as you training set and test set, respectively. Which means that you will use half of the data for training, and the remaining half for testing. a) First we use 1-Nearest Neighbour rule (1-NN) and get an average error rate (averaged over both test and training data sets) of 8%. What was the error rate with 1-nearest neighbour on the test set? Briefly reason the answer. b) Next we use the Adaboost Algorithm and get an error rate of 10% on the training data. We also get the average error rate (averaged over both test and training data sets) of 12%. What was the error rate with the Adaboost Algorithm on the test set? Just answer the error rate. c) Now, we swap the roles of Part I and Part II, and repeat the same experiments. On the test set (Part I), we get an error rate of 12% with both 1-NN and the Adaboost Algorithm. Based on all these results, by the cross-validation, indicate the method which we should prefer to use for classification of new observations, with a simple tivate W reasoning. Go to Setting Suppose that you take a data set and divide it into two parts of equal size. The first part (Part I) and the second part (Part II). You will try out two different classification procedures, by using Part I and Part II as you training set and test set, respectively. Which means that you will use half of the data for training, and the remaining half for testing. a) First we use 1-Nearest Neighbour rule (1-NN) and get an average error rate (averaged over both test and training data sets) of 8%. What was the error rate with 1-nearest neighbour on the test set? Briefly reason the answer. b) Next we use the Adaboost Algorithm and get an error rate of 10% on the training data. We also get the average error rate (averaged over both test and training data sets) of 12%. What was the error rate with the Adaboost Algorithm on the test set? Just answer the error rate. c) Now, we swap the roles of Part I and Part II, and repeat the same experiments. On the test set (Part I), we get an error rate of 12% with both 1-NN and the Adaboost Algorithm. Based on all these results, by the cross-validation, indicate the method which we should prefer to use for classification of new observations, with a simple tivate W reasoning. Go to Setting
Expert Answer:
Answer rating: 100% (QA)
a If the average error rate for 1Nearest Neighbor 1NN on both the training and test sets is 8 ... View the full answer
Related Book For
Research Methods For Business Students
ISBN: 9781292208787
8th Edition
Authors: Mark Saunders, Philip Lewis, Adrian Thornhill
Posted Date:
Students also viewed these programming questions
-
What is the legal basis for protecting employees from harassment with supporting laws or regulations explain in detail.
-
Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...
-
Managing Scope Changes Case Study Scope changes on a project can occur regardless of how well the project is planned or executed. Scope changes can be the result of something that was omitted during...
-
Give the approximate temperature at which creep deformation becomes an important consideration for each of the following metals: nickel, copper, iron, tungsten, lead, and aluminum.
-
For the weldment of Prob. 91 the electrode specified is E7010. For the electrode metal, what is the allowable load on the weldment?
-
Hamlet acquires a 7-year class asset on November 23, 2015, for $100,000. Hamlet does not elect immediate expensing under 179. He does not claim any available additional first-year depreciation....
-
The altitudes (in kilometers) of atmosphere at which helium is found in majority in 10 different cities are listed. 938.5 927.0 929.5 930.3 934.3 936.0 926.2 930.5 924.8 870.7 (a) Find the range of...
-
Kim Panenka asked to borrow $ 4,750 from her sister, Kris, to make a mortgage payment. Kris deposited a check for that amount into Kims bank account. Hours later, Kim asked to borrow another $ 1,100....
-
Criminal possession of a weapon in the second degree. A person is guilty of criminal possession of a weapon in the second degree when: (1) with intent to use the same unlawfully against another, such...
-
Consider the state diagrams of Figure 12.28. a. Describe the behavior of each. b. Compare these with the branch prediction state diagram in Section 12.4. Discuss the relative merits of each of the...
-
The yearly load duration curve of a certain power station can be approximated as a straight line; the maximum and minimum loads being 100 MW and 60 MW respectively. To meet this load, three...
-
A 51-kg packing crate is pulled across a rough floor with a rope that is at an angle of 43 above the horizontal. If the tension in the rope is 120 N, how much work is done on the crate to move it 18...
-
How is income generated from the investment of an endowment treated under the Restricted Fund method when it is restricted?
-
The UCC balances as at January 1, 2021, are: Class 1: $5,000,000 Class 8: $3,200,000 Class 10: $400,000 Class 12: nil12. Deco has the following capital asset additions and disposals during the year:...
-
After falling from rest from a height of 30 m, a 0.50-kg ball rebounds upward, reaching a height of 20 m. If the contact between ball and ground lasted 2.0 ms, what average force was exerted on the...
-
A syringe has an area of 0.8 cm2 at its barrel and then narrows down to an area of 0.11 mm2 at the needle end. If a force of 5.1 N is applied to the syringe, what is the force produced at the tip of...
-
Example 2. Find general solution for y" - 3y + 4y = 2e-t.
-
An example of prescriptive analytics is when an action is recommended based on previously observed actions. For example, an analysis might help determine procedures to follow when new accounts are...
-
Visit an online database or your university library and obtain a copy of an article that you think will be of use to an assignment you are both currently working on. Use the checklist in Box 3.13 to...
-
Visit the CAQDAS websites listed in Table 13.2 . Using the information available on each website, explore the suitability of each program for the nature of your data and chosen approach to analyse...
-
For each of the following research questions it has not been possible for you to obtain a sampling frame. Suggest the most suitable non-probability sampling technique to obtain the necessary data,...
-
The NCAA is described as a cartel. In what way is it a cartel? What is the product being produced? How does the cartel stay together?
-
What is the cost to a firm in an oligopoly that fails to take rivals actions into account?
-
The cement industry is an example of an undifferentiated oligopoly. The automobile industry is a differentiated oligopoly. Which of these two is more likely to advertise? Why?
Study smarter with the SolutionInn App