Question: In R, python, or your favorite programming language, write a function that takes as arguments a matrix X and a column vector y and returns

In R, python, or your favorite programming language, write a function that takes as arguments a matrix X and a column vector y and returns a list containing a column vector of OLS coefficient estimates, the standard estimate of the covariance matrix of the parameters and an estimate of the variance. Then use the file on canvas titled MonteCarloShell.R to run the following experiments (if you are using a language besides R you will need to write your own shell):

1. In your Monte Carlo shell in the section that computes the data for each trial create a column vector of standard normal random numbers of length N (the value of N to be specified later). Call it e. Then create an Nx2 matrix where the first column is all ones and the second column is made up of standard normal random variates. Next create a 2x1 matrix containing the values 1 and 2. Finally, compute the column vector y as the sum of the column vector equal to X times b and the column vector e or y = Xb + e.

a. Now set N to 3 and run 10,000 Monte Carlo trials. Recover each simulated estimate of the coefficient on the second column of the X matrix and each simulated estimate of the variance of the estimated coefficient. What is the mean of the simulated estimates of b[2]. Create a histogram of these results for the simulated coefficient estimates with 101 cells in the range 0 to 4. Compare the variance of the 10,000 estimates to the average value of the estimated variance across the 10,000 estimates. How do they compare? How accurate are the estimates of the coefficient? What fraction of the time are the estimates at least greater than zero?

b. Do the same as in a. with N = 5, 10, 50, 100, 1000 and 10,000. How does the graph change? How do the estimated variances compare to the actual variance of the estimates? How might you relate what you are seeing in the graphs to the concept of a probability limit (plim)?

c. Compute the 2 tail t-statistic for the hypothesis test b[2] = 2 for each of the 10,000 estimates of b[2] for the sample sizes of 5, 50 and 100. Find the .05 critical value for the t distribution for this model with each of the sample sizes. What fraction of the time do you reject the hypothesis that b[2] = 2. What would you expect to happen if you used a .01 critical value?

d. (extra credit) Use some other distribution of the errors besides the normal distribution and repeat c. for sample sizes 3, 10 and 50. How important is the assumption of normality of e for the t-test? Can you find a distribution of the errors that produces 20% more or fewer rejections than should happen with

normal errors for a sample size of 15?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

1. Please use R-programing to write the following code. Please show all steps, and outputs. Please also share you code and aswer all questions given in the format below. ```{r setup, include=FALSE}...

1. Please use R-programing to write the following code. Please show all steps, and outputs. Please also share you code and aswer all questions given in the format below. Aswer in the format given...

1. Please write the debug the follwing question in R. Please answer all questions and share outputs and code. Please do not write by hand. Program to Debugg. Functions in R Debugging a function a. In...

1. Please write the debug the follwing question in R. Please answer all questions and share outputs and code. Please do not write by hand. Please share code for how you run all test and outputs and...

Hello, we have an exam coming up soon from a professor that barely ever teaches the right material and was wondering if I could get a little extra help with this study guide. Thanks! Assume V is a...

(d) We've said that software models the real world. One aspect of the real world that we sometimes want to model is randomness. If we're writing software to play a card game, we don't want the same...

WL Question: You are given a tree T = (V, E) alongside an assigned root hub r V . The parent of any hub v 6 = r, signified p(v), is characterized to be the hub contiguous v in the way from r to v. By...

3.2.5 my heuristic(current board, goal board) Write a function that takes current board and goal board as arguments and returns an estimate of how many moves it will take to reach the goal board....

Download file on https://drive.google.com/file/d/1yjzKN5bz9l7NRgr2NJ_YOJB5PAR0PkB1/view I have done questions 1-3 but I am stuck with question # 4 Here is the R code for questions 1-3: Question # 1...

io (a) Give the general formula for estimating transition probabilities from training data. Provide the full transition matrix A for this HMM based on the training data shown. [6 marks] (b) Give the...

A backhoe acquired on January 5 at a cost of $84,000 has an estimated useful life of 12 years. Assuming that it will have no residual value, determine the depreciation for each of the first two years...

six month TB rate is 4 . 5 % . The market return is 1 4 % if the beta.

Calculate the concentration of CO2 in a 0.010M solution of carbonic acid. The acid dissociation constants are: (5pts) H2CO3H++HCO3HCO3H++CO32ka1=4.3107ka2=5.61011 Calculate the pH of a solution that...

What is the basis for Security Concerns in Cloud Computing?

Should Needs and GAP Analyses be equally applied in terms of effort when off-theshelf System Solutions being acquired versus building a custom system using Vendors or internal Programming Staff?

Describe the three main Cloud Computing Environments.