Question: Stat 311 Homework 3 This assignment will use the plasma retinol data set (PlasmaRetinolData.csv) and focuses on simple linear regression. Be sure to check the

Stat 311 Homework 3 This assignment will use the plasma retinol data set (PlasmaRetinolData.csv) and focuses on simple linear regression. Be sure to check the data dictionary (PlasmaRetinolDataDescptionpdt) to know What the variables stand for and to get their corresponding units for axis labels and interpretations in the context of the problem. . Some basic EDA a) Read in the data and use the ggplot2 package to make histograms of BetaDiet, 1n (BetaDiet), BetaPlasma, and in (BetaPlasma). Make sure your histograms are on one gure, so they are easy to compare. Compare the log transformed distribution to the unlogged distribution for each variable. b) You should see the following warning, \"Warning: Removed 1 rows containing nonnite values (stat_bin).\" in the output on the R Console page after rendering the histograms. You are getting this error because there are one or more zeros in BetaDiet or BetaPlasma; 111(0) is undened. I have included the code in the template to pull out these observations. How many observations are impacted by applying a natural log transformation? c) Create a subset of data that removes any points with zero in x or y. Use this subset for the remaining problems
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
