Question: You are given the 1 6 training data points on the figure, denoted by + and - symbols, along with 3 test data points, denoted

You are given the 16 training data points on the figure, denoted by + and - symbols, along
with 3 test data points, denoted by squares (for which you have no labels). The axes intention-
ally have no tick marks, but you can assume that the scale of the x1 and x2 axes are the same.
(a) The closest average classifier has a high training
error rate. Explain why this is the case.
(b) Is there a linear classifier of the form Dlinear(x1,x2)={+1,a1x1+a2x20-1,a1x1+a2x20
that can attain training error rate less than 14? Justify your answer.
(c) Consider a non-linear classifier of the form Dnon-linear(x1,x2)={+1,g(x1,x2)-1,g(x1,x2)>
where the function g(x1,x2) depends only on x1,x2 while the threshold can depend on
the training data. Select a good function g(x1,x2) for this scenario. Justify your answer.
(d) Write a detailed procedure for selecting the threshold from part (c) using the training
data. Your answer should be written explicitly in terms of ({:xtrain,1,1,xtrain,1,2,Ytrain,1) for
i=1,dots,ntrain(but it does not need to be MATLAB/Python code or pseudo-code).
(e) Sketch the decision boundary for your trained non-linear classifier from parts (c) and (d)
on the figure above. In each test data square, write the label selected by your classifier.
 You are given the 16 training data points on the figure,

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!