Question: 1. (30 points) Suppose we use a linear SVM classifier for a binary classification problem with a set of data points shown in Figure 1,

1. (30 points) Suppose we use a linear SVM classifier for a binary classification problem with a set of data points shown in Figure 1, where the samples closest to the boundary are illustrated: samples with positive labels are (0, 1), (-2, -1), (1, 3), (-1, 1.5), (-1.5, 0.5), and samples with negative labels are (0.5, -0.5), (-0.5, -2), (1.5, -1), (3, 0.5), (-1, -2.5) . positive samples X1 - X2 = - . . negative samples 2 (1 - X2 = 0 *1 - X2 = 1 4 - 3 -2 -1 2 3 4X -3 Figure 1: For a binary classification problem where positive samples are shown red and negative ones shown blue. The solid line is the boundary and dashed ones define the margins. (a) List the support vectors. (b) Pick three samples and calculate their distances to the hyperplane X1 - X2 = 0. (c) If the sample (0, 1) is removed, will the decision boundary change? What if we remove the sample (-1, -2.5) instead? d) If a new sample (2, 0.5) comes as a positive sample, will the decision boundary change? If so, what method will you use in this case? (e) In the soft margin SVM method, C is a hyperparameter (see Eqs. 13.10 or 13.11 in Chapter 13.3 of the textbook). What would happen when you use a very large value of C? How about using a very small one? (f) In real-world applications, how would you decide which SVM methods to use (hard margin vs. soft margin, linear vs. kernel)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

1. (30 points) Suppose we use a linear SVM classier for a binary classication problem with a set of data points shown in Figure l, where the samples closest to the boundary are illustrated: samples...

Suppose we use a linear SVM classifier for a binary classification problem with a set of data points shown in Figure [ 1 , where the samples closest to the boundary are illustrated: samples with...

Urgently. 4. [10 points] On classification, circle "T" (true) or "F" (false) for each statement. Note: Bonus points can be given if false statements are corrected. T/F: Logistic regression works well...

Please explain in details, and I don't want answers copied from other WEBSITES ANSWERS . Thanks! 4. [10 points] On classification, circle "T" (true) or "F" (false) for each statement. Note: Bonus...

4. [10 points] On classification, circle "T" (true) or "F" (false) for each statement. Note: Bonus points can be given if false statements are corrected. T/F: Logistic regression works well for...

1 Support Vector Machine [20 points) In Figure la and Figure 1b, we are given some data points in 2-D space and we aim to train a hard margin linear SVM classifier. The positive data points (with "+"...

In this problem you will make a prediction for a binary classification problem using kernel- SVM method for the dataset shown in Figure 1. The data set contains eight data points (xi,r2,...,rs)...

1. Consider the following classification problem. The data is characterized by two features x and x2 with the classification shown in the figure below (Co are green circles, C1are red x's). Describe...

linear regression please help quick!! 91 The following is the simple linear regression model: y = Bo + Bx For a given set of (x;; yj),i= 1, .. k, the following best-fit equation can be used to...

CSE 5 8 1 0 Fall 2 0 2 2 Midterm Exam Name: 1 . What is the difference between the criteria for supervised and unsupervised dimensionality reduction? Explain in not more than 4 - 5 sentences. We have...

A company is trying to make a long-term investment decision: should it or should it not manufacture a new Yes product? The company believes that $238,000 would need Invest to be immediately invested...

Using the following diagram, what payments c would make the 2 diagrams C equal at an interest rate of 8%? 57 5 6 10 250 20 3c 3c 750 If an interest rate of 8% per year, determine the unknown quantity...

which of these processor implementations should NOT be available at the RISC ISA level? Multiple options might exist. clock frequency, length of the instruction, instructions per cycle ( IPC ) ,...

Waterway Industries has 24000 units in beginning finished goods. If sales are expected to be 130000 units for the year and Waterway desires ending finished goods of 30000 units, how many units must...

Do not take criticism personally, even if it is meant to be personal.

2. To understand that as a general rule it takes about 3 minutes before the other part(s) have made up a 90% impression of you.

3. To understand that the first impression means a lot, and it takes a lot of energy to change this first impression.