Question: Suppose we have a data set {(x1, y),..., (2N, yn)}. Xi's and labels yi's are both binary: i.e., Li, Yi E {0, 1} for all

Suppose we have a data set {(x1, y),..., (2N, yn)}. Xi's

Suppose we have a data set {(x1, y),..., (2N, yn)}. Xi's and labels yi's are both binary: i.e., Li, Yi E {0, 1} for all i. We know the generating process of this dataset: for each (Li, Yi), first label yi are generated from the Bernoulli distribution: Yi ~ Bernoulli(1/2). In other words, Pr(yi = 1) = Pr(yi = 0) = 1/2. Then, if yi = 1, then Ti~ Bernoulli(p), if yi = 0, then Xi~ Bernoulli(q). In other words, Pr(xi = 1 | Yi = 1) =p, Pr(di = 1 | Yi = 0) = q. Suppose p > q, we would like to find the Bayes optimal classifier f* :X + Y, which predicts label yi based on ti. (i) (Points: 10) What is the Bayes optimal classifier f*(x)? (ii) (Points: 10) Prove that the classifier has minimal risk among all deterministic classifiers. Suppose we have a data set {(x1, y),..., (2N, yn)}. Xi's and labels yi's are both binary: i.e., Li, Yi E {0, 1} for all i. We know the generating process of this dataset: for each (Li, Yi), first label yi are generated from the Bernoulli distribution: Yi ~ Bernoulli(1/2). In other words, Pr(yi = 1) = Pr(yi = 0) = 1/2. Then, if yi = 1, then Ti~ Bernoulli(p), if yi = 0, then Xi~ Bernoulli(q). In other words, Pr(xi = 1 | Yi = 1) =p, Pr(di = 1 | Yi = 0) = q. Suppose p > q, we would like to find the Bayes optimal classifier f* :X + Y, which predicts label yi based on ti. (i) (Points: 10) What is the Bayes optimal classifier f*(x)? (ii) (Points: 10) Prove that the classifier has minimal risk among all deterministic classifiers

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Suppose we have a data set {(x1,y1),..., (XN, YN)}. xi's and labels yi's are both binary: i.e., xi, Yi {0, 1} for all i. We know the generating process of this dataset: for each (xi, Yi), first label...

Exercises Chapter 2 2.1 Marginal and conditional probability: The social mobility data from Section 2.5 gives a joint probability distribution on (Y1 , Y2 )= (father's occupation, son's occupation)....

Stat-3503/Stat-8109 Airoldi/Fall-21 problem set no. 1 due monday 10/25 before lecture starts learning objectives. compute likelihoods, both for a generic sample, i.e., (x1, ..., xn), and for a...

Least Squares Approximation Hector D. Ceniceros 1 Least Squares Approximation Let f be a continuous function on [a, b]. We would like to find the best approximation to f by a polynomial of degree at...

BA 1605: Midterm Recap (Due: Feb. 27, 2015) Name _____________________________ 50 Student ID _____________________________ Section 01B 10:00~11:20 am Section 02B 01:00~02:20 pm [Questions 4 ~ 7] The...

dee complete please help Complexity Theory (a) Defifine the set of Boolean expressions 2CNF and the language 2SAT over them. (b) For a Boolean expression in 2CNF, let G() be the directed graph with...

Suppose you have bivariate data (x1, y1), . . . ,(xn, yn). A common model is that there is a linear relationship between x and y, so in principle the data should lie exactly along a line. However...

Problem 2.(BONUS](20 points) In this problem we aim at generalizing the Logistic Re- gression algorithm to multi-class classification problem, the setting where the label space includes three or more...

ST332 & ST409 Medical Statistics 2014-15: Exercises 2 1) Clinical Trial Design (based on an old exam question) A new drug against AIDS has been developed that may be useful in conjunction with the...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

2. Which of the following would you expect to have the highest boiling point: CH;CH,CH3 CH;OH, CH;CH2OH , CH3COOH ? Explain your answer. (4 Marks) 3. Outline the type of hybridation in Ethanoic acid...

The following data represent the Diagnosis of a random sample of 20 patients admitted to a hospital. Determine the mode diagnosis. Motor vehicle accident Fall Motor vehicle accident Motor vehicle...

Seloct the secenario that is an evample of a legal financial interwecliation. Btna giving Sid $ 1 0 , 0 0 0 to purchase a used car Marilyw encouraging Howand to get tinancial atvice before purchasing...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

a. How are members selected to join the team?

b. Will new members be welcomed?

c. Will leaders rotate periodically?