Question: Problem 1 Consider the fat data from the faraway library in R. The following code is an example of how to select a random test

Problem 1 Consider the fat data from the faraway library in R. The following code is an example of how to select a random test set of 25 observations, and to use the remaining observations as the training set. In the code, we set the random seed to make the result reproducible, but this seed can be changed. library (faraway) n=dim (fat) [1] set . seed (12357) testid = sample(n, 25, replace=FALSE) trainid = -testid test = fat [testid, ] train = fat [trainid, ] We will compare several regression methods using train/test evaluation. a) For the fat data, create a randomly selected test set of 25 observations and a training set consisting of all the other observations, removing the variables brozek and density from the data. Display the first 6 rows of the training and test sets. Also display the dimensions of the training data frame and test data frame. Answer: b) Use the training data to estimate the linear regression of siri on all of the other variables except for brozek and density. Then use the test data to compute the estimated meansquare error for prediction. Answer: c) Repeat exercise b) for linear regression with variables selected using the BIC criterion (leaps and bounds or stepwise) Answer: d) Repeat exercise b) for scaled principal components regression, where you keep enough components to account for 90% of the variation in predictor variables. Answer: e) Repeat exercise b) for Lasso regression, where the amount of shrinkage is selected by 10-fold cross-validation

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

Problem 1 Consider the fat data from the faraway library in R. The following code is an example of how to select a random test set of 25 observations, and to use the remaining observations as the...

Chapter 09 #1 of 4 Question 1 of 7 - /1 E ... View Policies Current Attempt in Progress Explain if the following is a two-tailed test, a left-tailed test, or a right-tailed test. Ho: M = 72, H1: M 80...

For continuous random variables X and Y , taking on continuous values x and y respectively with probability densities p(x) and p(y) and with joint probability distribution p(x, y) and conditional...

PLEASE HELP ME CODE THIS PROGRAM. ALL I ASK FOR ARE SOME EXAMPLES OF HOW TO GO ABOUT DOING THIS AS YOU WILL SEE IN MY BOLD & ITALIC MESSAGES THROUGHOUT THIS PROBLEM. I HAVE INCLUDED CODE FOR YOU TO...

Practice Set 4 QNT/275 Version 6 1 University of Phoenix Material Practice Set 4 Practice Set 4 1. Find z for each of the following confidence levels. Round to two decimal places. A. B. C. D. E. F....

Problem 1 Consider a class Time that represents a time of day. It has attributes for the hour and minute. The hour value ranges from 0 to 23, where the range 0 to 11 represents a time before noon....

Performance Lawn Equipment KSA Individual Assignment EBTM 720 Performance Lawn Equipment (PLE), headquartered in St. Louis, Missouri, is a privately owned designer and producer of traditional lawn...

\fThis is an electronic version of the print textbook. Due to electronic rights restrictions, some third party content may be suppressed. Editorial review has deemed that any suppressed content does...

Four independent situations are described below. Each involves future deductible amounts and/or future taxable amounts produced by temporary differences: Taxable income Future deductible amounts...

Expand: (k+10) (2k+7) (4k) Give your answer in its simplest form.

The relationship of an option's in - the - money strike price to the current futures price, assuming time value is zero, is called Target price Intrinsic value Premium Cash price Option price

False QUESTION 5 10 points Practice using Weighted Averages to Calculate Expected Return-See Section 8-2A Consider an investment that you predict will earn 2.9% in a recession, 4.9% during normal...

4. What kinds of businesses are most likely to benefit from using cloud computing? Why? Cloud computing is taking off. The biggest players in the cloud computing marketplace include Amazon Web...

5-10 Qantas Airways, Australias leading airline, faces cost pressures from high fuel prices and lower levels of global airline traffic. To remain competitive, the airline must find ways to keep costs...

5-13 In MyMISLab, you will find a Collaboration and Teamwork Project dealing with the concepts in this chapter. You will be able to use Google Drive, Google Docs, Google Sites, Google+, or other...