Question: 3 (15). You are given a dataset for a 2-class classification problem. You try three methods on it minimum distance classifier, perceptron, and k-nearest neighbors.

3 (15). You are given a dataset for a 2-class classification

3 (15). You are given a dataset for a 2-class classification problem. You try three methods on it minimum distance classifier, perceptron, and k-nearest neighbors. You try each algorithm in different ways (e.g., changing k, changing learning rates, etc.) until you can get no further improvement. The best overall errors you get in each case are Minimum Distance Classifier: J 0.4 * Perceptron: k-Nearest Neighbors J 0.05 All errors are from a normalized range between 0 and 1, so J-: 0.41s 40% error Assume that each algorithm is used in its standard form and with raw data (i.e., no variable k, no scaling of data, etc.) a) (10 points) Based only on these errors, what can you say about the distribution of the data for the two classes in feature space? Give a bullet-list of all the significant things you can think about, and in each case explain why you think that based on the errors you got. b) (5 points) Assuming a 2-dimensional feature space, draw an approximate picture that illustrates your points about how the data for the two classes is distributed Remember, there is no absolute "right" or "wrong" answer here. You are speculating, but your conjectures must be plausible and justifiable. You will get points for plausible conjectures, and lose points for implausible ones and those you cannot explain. 3 (15). You are given a dataset for a 2-class classification problem. You try three methods on it minimum distance classifier, perceptron, and k-nearest neighbors. You try each algorithm in different ways (e.g., changing k, changing learning rates, etc.) until you can get no further improvement. The best overall errors you get in each case are Minimum Distance Classifier: J 0.4 * Perceptron: k-Nearest Neighbors J 0.05 All errors are from a normalized range between 0 and 1, so J-: 0.41s 40% error Assume that each algorithm is used in its standard form and with raw data (i.e., no variable k, no scaling of data, etc.) a) (10 points) Based only on these errors, what can you say about the distribution of the data for the two classes in feature space? Give a bullet-list of all the significant things you can think about, and in each case explain why you think that based on the errors you got. b) (5 points) Assuming a 2-dimensional feature space, draw an approximate picture that illustrates your points about how the data for the two classes is distributed Remember, there is no absolute "right" or "wrong" answer here. You are speculating, but your conjectures must be plausible and justifiable. You will get points for plausible conjectures, and lose points for implausible ones and those you cannot explain

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

INSTRUCTIONS ---> Python There are three parts to this project in Python. Please read all sections of the instructions carefully. I. Perceptron Learning Algorithm II. Linear Regression III....

INSTRUCTIONS There are three parts to this project in Python. Please read all sections of the instructions carefully. I. Perceptron Learning Algorithm II. Linear Regression III. Classification You...

2 Biomedical applications of machine learning and computational modeling Name 2 examples of biomedical applications in which you might use a binary classifier; contrast each of those two examples to...

I did this assignment to complete a KNN classifier, but I'm having trouble to identify what's wrong with my code, specifically when trying to use the model with 3 neighbors. This is the title of the...

Learning Objectives Identify classification learning algorithms in supervised learning paradigm Identify what is K-nearest neighbor (KNN) and how it works Identify what is logistic regression and...

Repeat Problem 3.31 if the temperature distribution on the top surface of the bar varies sinusoidally from 40?C at the left edge to a maximum of 250?C in the center and back to 40?C at the right...

Show the product you would obtain from the reaction of cellobiose with the following reagents: (a) NaBH4 (b) Br2 H2O (c) CH3COCl, pyridine

7. Which of the following compounds is not a hydrocarbon? (a) C*H_{4} (b) CH-CHOH-CH-CH, (c) C*H_{3}*CH = C*H_{2} (d) C*H_{2} = C*H_{2}

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

LAST WORD Do you think exceptionally high pay to CEOs is economically justified? Why or why not?

What is meant by investment in human capital? Use this concept to explain (a) wage differentials, and (b) the long-run rise of real wage rates in the United States.

KEY QUESTION Explain why economic rent is a surplus payment when viewed by the economy as a whole but a cost of production from the standpoint of individual firms and industries. Explain: Land rent...