Question: Problem 3. Recall that in classication we assume that each data point is an i.i.d. sample from a(n unknown) distribution P(X = :13, Y =

Problem 3. Recall that in classication we assume that each data point is an i.i.d. sample from a(n unknown) distribution P(X = :13, Y = y). In this question, we are going to design the data distribution P and evaluate the performance of logistic regression on data generated using P. Keep in mind that we would like to make P as simple as we could. In the following, we assume 3: E R and y E {0, 1}, i.e. the data is onedimensional and the label is binary. Write P(X = 33,Y = y) = P(X = m)P(Y = y|X = :13). We Will generate X = :1: according to the uniform distribution on the interval [0,1] (thus P(X : 3:) is just the pdf of the uniform distribution) . 1. Design P(Y : y|X : :13) such that (i) P(y : 0) : P(y : 1) : 0.5', and (ii) the classication accuracy of any classier is at most 0.9; and (iii) the accuracy of the Bayes optimal possible classier is at least 0.8. 2. Using Python, generate n : 100 training data points according to the distribution you designed above and train a binary classier using logistic regression on training data. 3. Generate n = 100 test data points according to the distribution you designed in part 1 and compute the prediction accuracy (on the test data) of the classier that you designed in part 2. Also, compute the accuracy of the Bayes optimal classier on the test data. Why do you think Bayes optimal classier is performing better? 4. Redo parts 2,3 with n = 1000. Are the results any different than part 3? Why

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

(b) Choose the best polynomial model obtained from the previous part, and use to it regress the entire dataset. Report the polynomial coefficients and make a scatter plot of the my's and ye's with...

uantitative Methods for MBAs Homework 6 Angelo Mele This homework tests your understanding of inference. Each question is worth 20 points. Please print this paper and answer the question in the...

Econ 222: Solution Template Assignment 3, Instructions 1. Data sets for this problem must be downloaded. 7.6 5.4 5.3 7.4 7.8 5.9 6.5 8.7 6.7 6.1 6.9 8.2 9.1 5.9 6.6 Agri-Beef Inc. operates cattle...

EE 351K Probability, Statistics and Stochastic Processes - Spring 2016 Homework 11 Topics: Point estimation, Condence intervals, Random Processes Homework and Exam Grading \"Philosophy\" Answering a...

University of Toronto Mississauga STA312- Topics in Statistics: Applied Statistical Modelling Assignment 1 Due date: Monday, October 3 Use R when that appropriate. Provide a print out of the results...

I am working on a homework problem and am stumped on part B. Kim hotels is interested in developing a new hotel in seoul. The company estimates that the hotel would require an initial investment of...

Please help with this Problem 5. We have access to a file consisting of n = 10 numbers. The numbers are either 1, or 2, or 3. Moreover, the value 1 appears n1 = 2600 times in the file, the value 2...

Answer the following question. Copy and paste from the SPSS program. W- SPSS3(2) Q Search in Document Home Insert Draw Design Layout References Mailings Review View Grammarly Table Design Layout o+...

Spring 2022 Home Announcements Grades Course Evaluations U E E tarletonjnstructurecom G u Dashboard ~23! Quiz: Exam 3 Question 13 2 pts Question 4- Intro Data indicates that adolescent girls tend to...

Question 1 4pts If neighbors wanted to have a speed bump installed on one part of the road, which statistical method would be most effective. Group of answer choices Regression with speed as an...

Consider the following situations for College Park Welding Services. a. Depreciation for the current year includes Equipment, $ 2,400. b. Each Monday, College Park pays employees for the previous...

Determine the force in each member of the Fink roof truss shown. State whether each member is in tension or compression. 4,5 ft 4.5 ft 12 kigns 12 kips 6 kips 12 kips 6 kips 2 ft 2ft -6 ft- -6 ft- 6t-

When raw materials are transferred out of the storeroom to the factory, their cost is transferred out of raw materials inventory and into finished goods inventory. Question 3 options: True False

A company issues 1,050 shares of its common stock for $33,600 cash. Prepare journal entries to record this event under each of the following separate situations.