Question: Hello! I could use some help with this problem. (a) Suppose you're given n outcomes Y = ( Y 1 , . . . Y

Hello! I could use some help with this problem.

(a) Suppose you're given n outcomes Y = (Y1,...Yn), and each outcome i is associated with p real-valued covariates (Xi1 , ... , Xip); let X denote the n x p design matrix X, and assume it's full rank with n > p. You run an OLS regression and obtain coefficients:

^= (XTX)-1 XTY

Recall that the p-value pj associated to the j'th coefficient is hte p-value for a t-test of the null hypothesis that the j'th coefficeint is zero. Suppose the population model takes the form Y = f(X1, ... , Xp) + for some function f, and error term . Specify the assumptions which would ensure validity of the t-test in this setting.

(b) Suppose we're given n observations of independent and identically distributed pairs (Y1, X1), ... , (Yn, Xn), where the Yi are binary outcomes, and the Xi are scalar (i.e. real-valued) covariates. The population model is the logistic regression model:

log(1P(Y=1X)P(Y=1X)) =(by definition)= logit(P(Y=1X))=1+ X.

Using the data, you fit an estimated coefficient ^ using maximum likelihood on the preceding population model. Let SE denote the estimated standard error of ^. For a large sample size n, give an approximate 95% CI for P(Y = 1 | X), and explain your reasoning.

Thanks!

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!