Question: Question 2. (To be done using *R). For this question, you will have to work with Pima Indians Diabetes Database data set in R. named
Question 2. (To be done using *R"). For this question, you will have to work with Pima Indians Diabetes Database data set in R. named 'Pimalndians Diabetes" from "mibench' library. The complete data set can be seen by simply typing Pima IndiansDiabetes into the console, however, for the sake of this question, we will be working with subsets of this data frame. As part of your solutions for this question, you will have to print screen or save some output, so I suggest to submit the entirety of this question as a Word or LaTeX document (use \begin{verbatim) Your R output lend verbatim))separate to the hand- written document you scan and submit for the previous questions. a) The first step in this question, is to remove the columns that we will not be using in the analysis from the original Pimalndians Diabetes data set, including the 'diabetes variable which contains the outcome (positiveegative), so that all 768 observations in the sample can be thought as being from the same population, ie, assume we did not know they were separated into diabetes outcomes. To do this, we will create a new data frame, called sample.data which consists of all rows of the the Pimalndians Diabetes data frame but only the columns from 1 to 4, using the following code: > library (mibench) >data ("PimaIndiansDiabetes") > sample data
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
