Question 2 pertains to two-mean inference based on independent samples. In practice, there are two possible...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Question 2 pertains to two-mean inference based on independent samples. In practice, there are two possible versions of this test: in one case, the "pooled" approach is used (if the sample variances are "close" in value, we assume equal population variances); in the other case the "unpooled" approach is used (if sample variances are not "close" in value, we do not assume equal population variances). Some instructors teach the pooled approach as the assumption of equal variances is necessary in other tests that look at comparing the means of several populations and, when the test is valid, the pooled approach is slightly more powerful. However, other instructors choose to not teach the pooled approach since the increase in statistical power is generally quite small, and it requires an additional assumption that should only be made that if we are very familiar with the background population. Your lecture instructor will let you know what approach you should use in the lecture. In the lab, we will solve all two independent sample problems using the non-pooled approach (i.e. without the assumption of equal variances). R allows for either approach; the default is "unpooled". Note: For two mean problems, R specifies the differences in means in alphabetical order. 2. Use R to perform the requested inference. (a) The dataset Insurance.xlxs contains information from random sample of 1338 policy holders of an insurance company. The variables of interest are "bmi" (the body mass index of policy holders) and "sex" (the reported gender of policy holders). You may assume the males and females are independent samples. Use the proper analytical tools in R to determine if there is significant evidence that the average BMI of females differs from that of males. Use the 10% significance level. Include all steps of your hypothesis test, and make sure to justify your assumptions. (7 marks) (b) Use the proper analytical tools in R with the dataset Insurance.xlsx to obtain a 90% confidence interval for the difference in average BMI of female and male policyholders. Does the interval provide evidence to indicate that the average BMI differs between females and males? (6 marks) (c) Interpret the confidence interval obtained in part (b). Does this interval support the conclusion of the hypothesis test in part (a)? Justify your answer. (4 marks: 2+2) (d) The dataset SummerStudents.xlsx contains information from a random sample of 44 students who took Statistics at MacEwan in the summer term. Your columns (variable) of interest are MUSICSTUDY (a column that records whether students listen to music while studying for an exam (no, yes)) and YAGE (a column that records student age). For education purposes assume that the two groups are two independent samples from a much larger normal population of statistics students and that the two populations have unequal variances. Use R to determine if, for a significance level of 5%, there is significant evidence that the mean age of students in the yes group is lower than the mean age of students in the no group. Choose the most correct (closest) answer. HINT. Be careful here. Recall that when doing a 2 independent samples t problem, R will calculate the numerator of the test statistic by subtracting the sample mean of the yeses from the sample mean of the nos. (7 marks) Question 2 pertains to two-mean inference based on independent samples. In practice, there are two possible versions of this test: in one case, the "pooled" approach is used (if the sample variances are "close" in value, we assume equal population variances); in the other case the "unpooled" approach is used (if sample variances are not "close" in value, we do not assume equal population variances). Some instructors teach the pooled approach as the assumption of equal variances is necessary in other tests that look at comparing the means of several populations and, when the test is valid, the pooled approach is slightly more powerful. However, other instructors choose to not teach the pooled approach since the increase in statistical power is generally quite small, and it requires an additional assumption that should only be made that if we are very familiar with the background population. Your lecture instructor will let you know what approach you should use in the lecture. In the lab, we will solve all two independent sample problems using the non-pooled approach (i.e. without the assumption of equal variances). R allows for either approach; the default is "unpooled". Note: For two mean problems, R specifies the differences in means in alphabetical order. 2. Use R to perform the requested inference. (a) The dataset Insurance.xlxs contains information from random sample of 1338 policy holders of an insurance company. The variables of interest are "bmi" (the body mass index of policy holders) and "sex" (the reported gender of policy holders). You may assume the males and females are independent samples. Use the proper analytical tools in R to determine if there is significant evidence that the average BMI of females differs from that of males. Use the 10% significance level. Include all steps of your hypothesis test, and make sure to justify your assumptions. (7 marks) (b) Use the proper analytical tools in R with the dataset Insurance.xlsx to obtain a 90% confidence interval for the difference in average BMI of female and male policyholders. Does the interval provide evidence to indicate that the average BMI differs between females and males? (6 marks) (c) Interpret the confidence interval obtained in part (b). Does this interval support the conclusion of the hypothesis test in part (a)? Justify your answer. (4 marks: 2+2) (d) The dataset SummerStudents.xlsx contains information from a random sample of 44 students who took Statistics at MacEwan in the summer term. Your columns (variable) of interest are MUSICSTUDY (a column that records whether students listen to music while studying for an exam (no, yes)) and YAGE (a column that records student age). For education purposes assume that the two groups are two independent samples from a much larger normal population of statistics students and that the two populations have unequal variances. Use R to determine if, for a significance level of 5%, there is significant evidence that the mean age of students in the yes group is lower than the mean age of students in the no group. Choose the most correct (closest) answer. HINT. Be careful here. Recall that when doing a 2 independent samples t problem, R will calculate the numerator of the test statistic by subtracting the sample mean of the yeses from the sample mean of the nos. (7 marks)
Expert Answer:
Posted Date:
Students also viewed these mathematics questions
-
A population of Ecuadorians have short stature. They have a rare genetic defect that affects the body's response to growth horm one, called Laron syndrome. Recent genetic studies have dem onstrated...
-
Refer to E8.11 and E8.13. Make the journal entries assuming Hegge uses the perpetual inventory system and the net price method.
-
What are the major species present in 0.015 M solutions of each of the following bases? a. KOH b. Ba(OH) 2 What is [OH - ] and the pH of each of these solutions?
-
The following note transactions occurred during the year for Zale Company: Nov. 25 Zale issued a 120 -day, 12 percent note payable for \(\$ 9,000\) to Porter Company for merchandise. Dec. 10 Zale's...
-
Ruddy is a retail company specializing in mens hats. Its budget director prepared the list of expected operating expenses that follows. All items are paid when incurred except sales commissions and...
-
Suppose your firm is considering investing in a project with the cash flows shown below, that the required rate of return on projects of this risk class is 11 percent, and that the maximum allowable...
-
A researcher wants to test whether displaying a new slogan ( the independent variable ) will increase sign - ups on a website newsletter ( the dependent variable ) . The researcher gathers a sample...
-
Develop a WBS for an organisation with 12 employees about to undergo an office move to a greenfield site. Be sure to identify the deliverables and the organisational units (people) responsible for...
-
Was the last Olympic and Paralympic Games a success? Carry out a search of the news of the time and the relevant websites to identify the characteristics of the project management that led to the...
-
Tesla. Elon Musk. Youve probably heard of both. Tesla was founded in 2003 by a group of engineers who wanted to prove that buyers didnt need to compromise looks and performance to drive electricthat...
-
Now review the AIPM site (www.aipm.com.au). (a) Review general information about the AIPM, as well as its membership information. (b) Review the benefits of being a member of the AIPM
-
For the similarity solution, what are the boundary conditions for the constant-wall-flux case? Show that a complete similarity does not exist for this case. Also show the condition for the case where...
-
The city of Toledo is considering adding additional storm water storage facilities so that it will never have sewage overflows. Economists have monetized the following effects over the expected...
-
The Ferris wheel in the figure has a radius of 68 feet. The clearance between the wheel and the ground is 14 feet. The rectangular coordinate system shown has its origin on the ground directly below...
-
The amounts of time (in months) that 16 stars of The Bachelorette and their chosen partner stayed together as of April 2021 are listed. 208 12 3 12 18 94 3 75 8 36 75 20 24 045
-
The six top-earning states in 2019 by median household income are listed. 1. Maryland 2. New Jersey 3. Hawaii 4. Massachusetts 5. Connecticut 6. Alaska
-
The ages of the Supreme Court justices as of April 7, 2021 In Exercises 25 and 26, find the range, mean, variance, and standard deviation of the population data set. 66 56 72 49 82 71 66 60 53
Study smarter with the SolutionInn App