Question: For the datasets below, assume data was obtained via random sampling and that the population standard deviation is unknown. For the given variables of interest,

  1. For the datasets below, assume data was obtained via random sampling and that the population standard deviation is unknown. For the given variables of interest,
  2. Use R to create a normal probability plot and a histogram. Paste your work into your report
  3. Describe your findings. Do the graphs provide evidence against the assumption of a normally distributed population?
  4. Based on your descriptions (and the sample size n), determine if one mean inference is valid, and whether z-procedures or t-procedures should be employed.
  5. The dataset "HIGHLANDS101" contains informationfrom a sample of 101 properties in the Highlands neighbourhood of Edmonton. The variable of interest is Assessed Value (the value Edmonton assessed the dwelling at for property taxes). (6 marks)
  6. The dataset "AQIEDMONTONMAY2023" contains a sample of 10 AQI values in Edmonton in the first 2 weeks of May 2023 at the start of forest fire season (see Part A for more details about AQI values). The variable of interest is AQI. (6 marks)
  7. The dataset HEARTFAILUREPREDICTIONMALES21 is a sample of 21 males that looks at several variables that play a role in heart failure prediction in the United States. The variable of interest is MaxHr (maximum heart rate achieved on exercising in bpm). (6 marks)
  8. In the above work, we did not examine the boxplots for the considered samples. What can a boxplot show us that cannot be seen in a normal probability plot or histogram? (2 marks)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!