# Question: Are students who participate in sports more extraverted than those

Are students who participate in sports more extraverted than those who do not? A random sample of students at a small university were asked to indicate whether they participated in sports (yes or no) and to rate their level of extraversion. Extraverts are outgoing, are talkative, and don't mind being the center of attention. Students were asked whether they agreed with the statement that they were extraverts, using a scale of 1-5 with 1 meaning "strongly disagree" and 5 meaning "strongly agree." There were 51 students who participated in sports and 64 who did not.

The mean extraversion score for those who participated in sports was 3.618, and the mean for those who did not participate was 3.172, a difference of 0.446 point. To determine whether the mean difference was significant, we performed a randomization test to test whether the mean extraversion level was greater for athletic students.

a. The histogram shows the results of 1000 randomizations of the data.

In each randomization, we found the mean difference between two groups that were randomly selected from the combined group of data: the combined data for the sporty and nonsporty. Note that, just as you would expect under the null hypothesis, the distribution is centered at about 0. The red line shows the observed sample mean difference in extraversion for the sporty minus the nonsporty. From the graph, does it look like the observed mean difference is unusual for this data set? Explain.

b. The software output gives us the probability of having an observed difference of 0.446 or more. (See the column labeled "Proportion = 7 Observed"). In other words, it gives us the right tail area, which is the p-value for a one-sided alternative that the mean extraversion score is higher for athletes than for nonathletes. State the p-value.

c. Using a significance level of 0.05, can we reject the null hypothesis that the means are equal?

d. If you did not have the computer output, explain how you would use the histogram to get an approximate p-value.

The mean extraversion score for those who participated in sports was 3.618, and the mean for those who did not participate was 3.172, a difference of 0.446 point. To determine whether the mean difference was significant, we performed a randomization test to test whether the mean extraversion level was greater for athletic students.

a. The histogram shows the results of 1000 randomizations of the data.

In each randomization, we found the mean difference between two groups that were randomly selected from the combined group of data: the combined data for the sporty and nonsporty. Note that, just as you would expect under the null hypothesis, the distribution is centered at about 0. The red line shows the observed sample mean difference in extraversion for the sporty minus the nonsporty. From the graph, does it look like the observed mean difference is unusual for this data set? Explain.

b. The software output gives us the probability of having an observed difference of 0.446 or more. (See the column labeled "Proportion = 7 Observed"). In other words, it gives us the right tail area, which is the p-value for a one-sided alternative that the mean extraversion score is higher for athletes than for nonathletes. State the p-value.

c. Using a significance level of 0.05, can we reject the null hypothesis that the means are equal?

d. If you did not have the computer output, explain how you would use the histogram to get an approximate p-value.

## Relevant Questions

Are women happier than men? A StatCrunch survey asked respondents to select a number from 1 (lowest) to 100 (highest) to measure their level of happiness. The sample mean for the 297 females was 71.15, and the sample mean ...McDonald's claims that its ice cream cones typically weigh 3.18 ounces (converted from grams). Here are the weights, in ounces, of cones purchased on different days from different servers: 4.2, 3.6, 3.9, 3.4, and 3.3 Carry ...Determine whether the number of texts sent by males and the number received by males are significantly different at the 0.05 level using a paired t-test. Figure A shows a scatterplot for the U.S. population (in millions) from 1850 to 1900. Figure B shows a residual plot of the same data. Give the reasons why the linear model is not appropriate for predicting the population in ...In Exercise 14.16 you examined the association between baseball players' ages and weights. Would a 95% confidence interval for the slope, based on the same data, include 0? Explain.Post your question