Question: please do not change the variables that are already written in the questions or the seed (if any is given). All code must be executable

please do not change the variables that are already written in the questions or the seed (if any is given).

All code must be executable from scratch. Code should be written properly (i.e. put code in functions as needed, declare variables as needed, and don't repeat yourself.) Only use functions from the packages loaded in the first block of code below for this problem set.

Make sure all plots contain appropriately labelled axes and are easy to read and interpret.

please do not change the variables that are already written in the

questions or the seed (if any is given). All code must be executable from scratch. Code should be written properly (i.e. put code in functions as needed, declare variables as needed, and don't repeat yourself.) Only use functions from the packages loaded in the first block of code below for this problem set. Make sure all plots contain appropriately labelled axes and are easy to read and interpret. - Part one (simulate

- Part one (simulate and recover) - draw random 2D X samples, augment this matrix with a column of ones - specify three known theta values (intercept and two slopes) - multiply x and put it through the logistic - From the resulting probabilities, generate 0,1 observations ( y values) - With a full X and y simulated dataset - code a log-likelihood function for the model given some data - optimize (verify you can recover the true parameters by optimizing the Il) Importing Packages [ ] import math import matplotlib.pyplot as plt import numpy as np import scipy import scipy.stats from scipy import stats import pandas as pd import math import csv import seaborn as sns from sklearn.linear_model import Logistickegression from sklearn.metrics import classification_report, confusion_matrix from sklearn.model_selection import train_test_split from sklearn.preprocessing import StandardScaler from sklearn.linear_model import LogistickegressioncV Logistic Regression Logistic Regression is a statistical method used to model the relationship between a binary dependent variable (e.g. 0 or 1 , yes or no) and one or more independent variables. The goal of logistic regression is to find the best fitting curve that represents the non-linear relationship between the dependent variable and the independent variables. Logistic regression takes the form of: f(x)=1+exex Part a Knowing the equation for logistic regression, let's now make a function called logistic that takes in x and returns an output between 0 and 1 [ ] def logistic (x): \#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\# Write your code below Part b Now call your function with some x values (say that x will range from 8 to 8 ) and plot the graph of the outputs. Hint: Remember that your output should be between 0 and 1 . \#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\# write your code below Looking back to BernoullI Distribution Recall that Bernoulli distribution is a probability distribution that models the outcomes of a single binary event, such as a coin toss or the presence of heart disease. We can view this as the probability of the event occuring as p and the probability of the event not occuring as 1p. Thus, the event has two possible outcomes (usually viewed as success or failure) occurring with probability p and 1p, respectively. pp(y)={p,1p,y=1y=0} The above function can be simplified to a single line as follows: pp(y)=py(1p)(1y) We obtain the Probability Mass Function, which is a function that gives the probability that a discrete random variable is exactly equal to some value. It takes the form: p=P(y=1x)=1+exex Where we can assume the parameters for the logistic regression. Remember, here it makes sense to formulate the problem such that =[0,1,2,] where the first theta value is the intercept, and x is augmented appropriately into a design matrix by appending a one. The Maximum likelihood estimator uses logistic function to estimate the possibility. Once we plug in the MLE Probability Mass Function form into the Bernoulli function, we get: pp(y)=py(1p)(1y)=(1+exiexi)y(11+exex)(1y)=1+exey(x) But this is only for a single variable, and we are interested in a whole set of data: L(Y1,Y2,,Yn,)=i=1npiyi(1pi)(1yi) To simplify this formula, we can take the Natural Log of the function, thus converting multiplication to summation: (Y1,Y2,,Yn,)=i=1nyilogpi+i=1n(1yi)log(1pi) Then we plug in the pp(y) formula we have before: (Y1,Y2,,Yn,)=i=1nyilog1+exieyi(xi)+i=1n(1yi)log(11+exieyi(xi)) After simplification, we get: (Y1,Y2,,Yn,)=i=1nyi(xi)i=1nlog(1+exi) This is the simplified logistic regression using MLE. Now let's simulate some sample x data and bernoulli outcomes. Here, the x data will have 2 features. To simulate the x data, draw a matrix of 2 by npoints matrix of random values drawn from a standard normal distributiob, where npoints is the number of samples. Then, declare a vector of known parameters, which includes a coefficient for each of the two x observations (per datapoint) and an offset. To account for the offset parameter, augment the 2 by npoints matrix with a column of ones so you can easily perform vector multiplication on the x data. Then using the logistic function from above, multiply the known parameters by the x data, put it through the logistic function, and generate some some ps (predicted probability). And finally using the ps we generated, lets sample some ys from binomial distribution where the probability of success equals to ps. Ultimatey, you should have just as many sample ys as the number of npoints . [ ] \#\#\#\# below we will specify the vector of known parameters, and the number of points. \#\#\#\#\#\#\#\#generate fake datan\#\#\#\#\#\#\#\#\#\#\# np.random. seed (0) w1=.2 w2=.8 w3=.4 params =[w1,w2,w3] \#this is the part we know because we are simulating the data, but in general would not know npoints =5000 \#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\# write your code below Part d Plot a histogram of all of the probabilities. Confirm that all the ps are between zero and 1. np.random. seed (0) \#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\# write your code below [ ] np.random. seed(0) \#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\# write your code below Recall, the logistic log likelihood is given by: (Y1,Y2,,Yn,)=i=1nyi(xi)i=1nlog(1+exi) Part f Using the formula above, define a function called logistic_Il that takes in 3 parameters: params, design_matrix, sample_ys. And returns the negative log-likelihood. [] \#\#\# run logistic regression using MLE import scipy as sp def logistic_ll(params, design_matrix, sample_ys): \#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\#\# write your code below Part g Using the above log likelihood function, use a lambda function to make the function only an argument of the parameters. Then run scipy . optimize.minimize with some randomly declared initial parameters. Show that the optimum inferred parameters are close to the true generative parameters

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Please do not change code that I have already written, and please do not write code in places other than where I indicate, unless you discuss it with me first. If you feel unable to complete an...

start from def mins_By_line(self,line): if you want to, but please help with bottom half of the image def activate_all (self): make sure every line in this plan is active after calling this method....

Type setup q5 to obtain the program used in this quiz. A makefile has been supplied that includes -g as one of the compiler options. You can build the program simply by typing make To run the...

I need help with my assignment. I got the consumer code done but I need help with the producer code where it starts at void Producer. I know I'm not getting that right. If anyone can help figure that...

i will add the program that needs to be editied. its cpp. please help asap!! its due soon. 2000-21090 , Assignments > The object of Horses (A4) The Object of Horses (A4) Due Friday by 11:59pm Points...

/* Assignment #: 4 Name: Your name StudentID: Lecture: Description: Assignment 4 class displays a menu of choices to a user and performs the chosen task. It will keep asking a user to enter the next...

For this assignment, you will need to write three source code files as well as two header files . Each of these files is relatively short, but many inexperienced programmers are overwhelmed by the...

This program needs to be factored, which I started doing but I've encountered a few problems. Below I've included what problems I've found and have been unable to solve. I've also included the...

Please help code the following inn python. Thank you All code must be executable from scratch. Code should be written properly (i.e. put code in functions as needed, declare variables as needed, and...

1. Write down the mathematical model and Draw the block diagram then Obtain the transfer function Vo(s)/V/(s) of the electrical systems shown in Figure (1). Where V(s) denotes the input voltage and...

8. Arrange in decreasing order of the Ka values (decreasing order of acidic strength) (1) (a) 4>1>3>2 (2) A (b) 2>3>4>1 (3) (c) 1>3>2>4 (4) -CF3 (d) 1>4>2>3

content area top Part 1 (Calculating annuity payments) You plan to buy some property in Hawaii 10 years from today. To do this, you estimate that you will need $29 comma 000 at that time. You would...

1 3 . Consider the definitions f and add, where the types ? 1 and ? 2 have been omitted: def f ( g: ? 1 , xs:List [ Int ] , ys:List [ Int ] ) : ? 2 = ( xs , ys ) match case ( a :: b , c :: d ) = > g...

5-55. The supermarket warehouse inventory reduction plan will be implemented next month?

5-57. It was Terri who, according to Ted, who is probably the worst gossip in the office (Tom excepted), mailed the wrong order.

5-60. Adaptation to the new rules was performed easily by the employees.