Question: Q1) Discriminant Linear Classifiers: You are given a training data set {xn, tn} of size N = 21. Each input vector xn is a point

Q1) Discriminant Linear Classifiers:

You are given a training data set {xn, tn} of size N = 21. Each input vector xn is a point in the 2-dimensional Euclidean space R2 . We have x1 = (0, 0), x2 = (1, 0), x3 = (2, 0), x4 = (0, 1), x5 = (1, 1), x6 = (2, 1), x7 = (3, 1), x8 = (4, 1), x9 = (5, 1), x10 = (100, 1), x11 = (0, 2), x12 = (1, 2), x13 = (2, 2), x14 = (3, 2), x15 = (4, 2), x16 = (5, 2), x17 = (100, 2), x18 = (3, 3), x19 = (4, 3), x20 = (5, 3), and x21 = (100, 3).]

There are two target classes C1 and C2. For each point xn in the training set, xn belongs to C1 if its second coordinate is less than or equal to 2, and belongs to C2 otherwise. If 1 xn C1, we have tn = 1. If xn C2, we have tn = 0 in the questions regarding least-squares linear discriminant and Fisher's linear discriminant, and have tn = 1 in the question on the perceptron algorithm.

(A) Compute the least-square linear classifier based on the training data. You need to write out (a) the error function, (b) the computed parameters (w0, w1, w2), and (c) plot the classification together with the training data.

(B) Compute the linear classifier based on the training data using Fisher's linear discriminant. You need to write out (a) the error function, (b) the computed parameters (w0, w1, w2), and (c) plot the classification together with the training data.

(C) Compute the linear classifier based on the training data using the perceptron algorithm, starting with the initial parameter (w0, w1, w2) = (1.5, 0, 0). For each iteration, you need to specify (a) the iteration number, (b) the current parameters, (c) the mis-classified input xn used in that particular iteration of stochastic gradient descent, and (d) the updating vector. When the algorithm converges, plot the classification together with the training data.

Q2). Continuous Bayes Classifier:

We want to build a Bayes classifier for a binary classification task (y = 1 or y = 2) with a 1-dimensional input feature (x). We know the following quantities: (1) P(y = 1) = 0.6; (2) P(x|y = 1) = 0.5 for 0 x 2 and P(x|y = 1) = 0 otherwise; and (3) P(x|y = 2) = 0.125 for 0 x 8 and P(x|y = 2) = 0 otherwise.

(A) What is the prior for class label y = 2?

(B) What is P(y = 1|x)?

(D) What is the decision boundary of your Bayes classifier?

Q3). Discrete Bayes Classifier :

We want to build a Bayes classifier for a binary classification task (y = 1 or y = 2) with two binary features (x1 and x2). We know the following quantities: (1) P(y = 1) = 0.6; (2) P(x1 = 0, x2 = 0|y = 1) = 0.3, P(x1 = 0, x2 = 1|y = 1) = 0.1, P(x1 = 1, x2 = 0|y = 1) = 0.4, P(x1 = 1, x2 = 1|y = 1) = 0.2, and (3) P(x1 = 0, x2 = 0|y = 2) = 0.4, P(x1 = 0, x2 = 1|y = 2) = 0.3, P(x1 = 1, x2 = 0|y = 2) = 0.2, P(x1 = 1, x2 = 1|y = 2) = 0.1, 2

(A) What is the prior for class label y = 2?

(B) What is P(y = 1|x)?

(C) For an example with x1 = 0 and x2 = 1, what is the class label your classifier will assign? Why? What is the risk of this decision?

(D) What is the decision boundary of your Bayes classifier?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

Jupyter Notebook Now that we have tried our hand at some single-layer nets, let's see how they stack up compared to multi-layer nets. :) We will be exploring the basic concepts of learning non-linear...

Convex geometry 0.1 Dehn-Sommerville relations In this section we will talk about convex polytopes. One possible denition of a convex polytope uses the notion of a half-space: A closed half-space in...

Jupyter NoteBook Once we decide to measure more than three features per input vector, it can become challenging to understand how a network is learning to solve such a problem since we can no longer...

CSCI 5525 MACHINE LEARNING, Fall 2017, Prof Schrater Homework 1 September 27, 2017 1. For data (x, y) with a joint distribution p(x, y) = p(y|x)p(x), the expected loss of a function f (x) to model y...

Please summarize this journal, the length of the summary should not be more than two pages with 1.5 spacing, size 12 Times New Rome. Expert Systems with Applications 38 (2011) 11347-11354 Contents...

Jupiter Notebook We have covered some of the limitations of single layer neural networks in class, but they are still powerful learning systems that provide a good way to begin learning about how to...

Please provide the summary of the methodology and your understanding of this paper. Incluse necessary figures as well. Rapid Object Detection using a Boosted Cascade of Simple Features single feature...

BA 1605: Midterm Recap (Due: Feb. 27, 2015) Name _____________________________ 50 Student ID _____________________________ Section 01B 10:00~11:20 am Section 02B 01:00~02:20 pm [Questions 4 ~ 7] The...

s1 educated (SSE) student for every three public school educated (PSE) students. Reasoning that students are not very dissimilar from threads, he suggests the following entry and exit routines be...

This laboratory requires you to write a Java program to use the OOP concepts such as the inheritance, polymorphism, abstract class/method, and interface to implement the program. Task Implement...

Select a company that has recorded impairment charges in its financial statements. For the selected company, identify: a. Company name and its principal line of business. b. Nature of financial...

In capital structure decisions, what is the primary concern? * Ensuring that the firm has enough cash on hand. Balancing debt and equity to minimize the cost of capital. Minimizing tax liability....

On 1 March 2007 DB Limited issued R560 000 15% debentures at R98. The debentures were to be redeemed at par in four equal annual payments starting 28 February 2010. Required: Journalise the above...