(Simulating Wald and Likelihood Ratio Tests) In this section we will investigate the distributions of hypothesis tests for logistic regression For this exercise, we will use the following predictors sample size 150 set seed(420) x1 rnorm(n sample size) x2 rnorm(n sample size) x3 rnorm(n sample size) Recall that p(x) P Y 1X x p(x) P Y 1X x Consider the true model log(p(x) 1p(x) ) 0 1 x 1 log(p(x)1p(x)) 0 1x1 where 0 0 4 0 0 4 1 0 35 1 0 35 (a) To investigate the distributions, simulate from this model 2500 times To do so, calculate P Y 1X x P Y 1X x for an observation, and then make a random draw from a Bernoulli distribution with that success probability (Note that a Bernoulli distribution is a Binomial distribution with parametern 1 n 1 There is no direction function inRfor a Bernoulli distribution ) Each time, fit the model log(p(x) 1p(x) ) 0 1 x 1 2 x 2 3 x 3 log(p(x)1p(x)) 0 1x1 2x2 3x3 Store the test statistics for two tests The Wald test forH 0 2 0 H0 2 0, which we say follows a standard normal distribution for large samples The likelihood ratio test forH 0 2 3 0 H0 2 3 0, which we say follows a 2 2distribution (with some degrees of freedom) for large samples (b) Plot a histogram of the empirical values for the Wald test statistic Overlay the density of the true distribution assuming a large sample (c) Use the empirical results for the Wald test statistic to estimate the probability of observing a test statistic larger than 1 Also report this probability using the true distribution of the test statistic assuming a large sample (d) Plot a histogram of the empirical values for the likelihood ratio test statistic Overlay the density of the true distribution assuming a large sample (e) Use the empirical results for the likelihood ratio test statistic to estimate the probability of observing a test statistic larger than 5 Also report this probability using the true distribution of the test statistic assuming a large sample (f) Repeat (a) (e) but with simulation using a smaller sample size of 10 Based on these results, is this sample size large enough to use the standard normal and 2 2distributions in this situation Explain sample size 10 set seed(420) x1 rnorm(n sample size) x2 rnorm(n sample size) x3 rnorm(n sample size) looking for R coding help

The Answer is in the image, click to view ...

Question: (Simulating Wald and Likelihood Ratio Tests) In this section we will investigate the distributions of hypothesis tests for logistic regression. For this exercise, we will

(Simulating Wald and Likelihood Ratio Tests)

In this section we will investigate the distributions of hypothesis tests for logistic regression. For this exercise, we will use the following predictors.

sample_size = 150 set.seed(420) x1 = rnorm(n = sample_size) x2 = rnorm(n = sample_size) x3 = rnorm(n = sample_size)

Recall that

p(x)=P[Y=1X=x]

Consider the true model

log(p(x)

1p(x)

log(p(x)1p(x))=0+1x1

where

0
=0.4
0=0.4
1
=0.35
1=0.35

(a)To investigate the distributions, simulate from this model 2500 times. To do so, calculate

P[Y=1X=x]

for an observation, and then make a random draw from a Bernoulli distribution with that success probability. (Note that a Bernoulli distribution is a Binomial distribution with parametern=1

n=1. There is no direction function inRfor a Bernoulli distribution.)

Each time, fit the model:

log(p(x)

1p(x)

log(p(x)1p(x))=0+1x1+2x2+3x3

Store the test statistics for two tests:

The Wald test forH
0
:
2
=0
H0:2=0, which we say follows a standard normal distribution for "large" samples
The likelihood ratio test forH
0
:
2
=
3
=0
H0:2=3=0, which we say follows a
2
2distribution (with some degrees of freedom) for "large" samples

(b)Plot a histogram of the empirical values for the Wald test statistic. Overlay the density of the true distribution assuming a large sample.

(c)Use the empirical results for the Wald test statistic to estimate the probability of observing a test statistic larger than 1. Also report this probability using the true distribution of the test statistic assuming a large sample.

(d)Plot a histogram of the empirical values for the likelihood ratio test statistic. Overlay the density of the true distribution assuming a large sample.

(e)Use the empirical results for the likelihood ratio test statistic to estimate the probability of observing a test statistic larger than 5. Also report this probability using the true distribution of the test statistic assuming a large sample.

(f)Repeat(a)-(e)but with simulation using a smaller sample size of 10. Based on these results, is this sample size large enough to use the standard normal and

2distributions in this situation? Explain.

sample_size = 10 set.seed(420) x1 = rnorm(n = sample_size) x2 = rnorm(n = sample_size) x3 = rnorm(n = sample_size)

looking for R coding help.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

COVER PAGE STAT 608 Homework 05 Summer 2017 Please TYPE your name and email address below, then convert to PDF and attach as the first page of your homework upload. NAME: EMAIL: HOMEWORK NUMBER:...

Possible Multiple Choice Questions for the Exam. Focus on the topics discussed in class. Chapter 1 Multiple Choice Identify the choice that best completes the statement or answers the question. ____...

PLEASE USE THE FOLLOWING ARTICLE TO ANSWER THE QUESTION: Wolf, L. A., Perhats, C., Delao, A., & Martinovich, Z. (2017). The effect of reported sleep, perceived fatigue, and sleepiness on cognitive...

include numbers from tables " Analysis of Maximum Likelihood Estimates and odd ratio estimate to the results and discussion " rephrase results below and discussion section INSTRUCTIONS : For this...

I need help with the article. I need to do 2 pages summary for the article ( CEO Turnover and Audit Pricing) I attatch . I attach the summary sample of article ( On the Timing of CEO Stock Option...

UNIVERSITY OF BRADFORD WORKSHOP 2 2016 Module PH3004D Learning Outcomes: By the end of this workshop you will be able to: 1) Use SPSS to check the normal distribution of the data assumption 2) Edit...

Assignment 8 Objective: To test examine the probability of a post-operative complication following an appendectomy. Dataset: Appendicitis.sas7bdat Package: SASStudio B. Particulars: 1. The dataset...

Hello, I have a question about logistic regression. Please help! In a survey study, the age of women, desire of having more children (1= yes and 0=No) and contraceptive use (1=yes and 0=No)...

I need to see the SPSS output. You need to have all z-scores, all charts, all descriptives data from SPSS, everything you used to answer the questions. I am sending you what the previous tutor sent...

INSTRUCTIONS : For this assignment you have to execute and interpret one regression model. The objective is to determine how much will be saved if 10% of complicated appendicitis cases were...

The three accounts shown below appear in the general ledger of Bennis Corp. during 2011. Instructions From the postings in the accounts, indicate how the information is reported on a statement of...

For this homework, students will design a networking solution for a fictional company (Wichita Boat, Inc). This will require that the infrastructure be upgraded and expanded in a manner that produces...

Which one of the following variables influences the value of call options? Level of interest rates Time to expiration of the option Dividend yield of underlying stock Stock price volatility Multiple...

You are planning to prepare for your child's college tuition. At the beginning of year 11 , you will have to pay $50,000 for the first-year tuition. The tuition is paid at the beginning of each year....