# Question: Some data sets contain outliers observations that fall well outside

Some data sets contain outliers, observations that fall well outside the overall pattern of the data. (We discuss outliers in more detail in Section 3.3.) Suppose, for instance, that you are interested in the ability of high school algebra students to compute square roots. You decide to give a square-root exam to 10 of these students. Unfortunately, one of the students had a fight with his girlfriend and cannot concentrate-he gets a 0. The 10 scores are displayed in increasing order in the following table. The score of 0 is an outlier.

0 58 61 63 67 69 70 71 78 80

Statisticians have a systematic method for avoiding extreme observations and outliers when they calculate means. They compute trimmed means, in which high and low observations are deleted or "trimmed off" before the mean is calculated. For instance, to compute the 10% trimmed mean of the test-score data, we first delete both the bottom 10% and the top 10% of the ordered data, that is, 0 and 80. Then we calculate the mean of the remaining data. Thus the 10% trimmed mean of the test-score data is

58 + 61 + 63 + 67 + 69 + 70 + 71 + 78 / 8 = 67.1.

The following table displays a set of scores for a 40-question algebra final exam.

a. Do any of the scores look like outliers?

b. Compute the usual mean of the data.

c. Compute the 5% trimmed mean of the data.

d. Compute the 10% trimmed mean of the data.

e. Compare the means you obtained in parts (b)–(d). Which of the three means provides the best measure of center for the data?

0 58 61 63 67 69 70 71 78 80

Statisticians have a systematic method for avoiding extreme observations and outliers when they calculate means. They compute trimmed means, in which high and low observations are deleted or "trimmed off" before the mean is calculated. For instance, to compute the 10% trimmed mean of the test-score data, we first delete both the bottom 10% and the top 10% of the ordered data, that is, 0 and 80. Then we calculate the mean of the remaining data. Thus the 10% trimmed mean of the test-score data is

58 + 61 + 63 + 67 + 69 + 70 + 71 + 78 / 8 = 67.1.

The following table displays a set of scores for a 40-question algebra final exam.

a. Do any of the scores look like outliers?

b. Compute the usual mean of the data.

c. Compute the 5% trimmed mean of the data.

d. Compute the 10% trimmed mean of the data.

e. Compare the means you obtained in parts (b)–(d). Which of the three means provides the best measure of center for the data?

## Answer to relevant Questions

Explain the difference between the quantities (Σxi)2 and Σx2i. Construct an example to show that, in general, those two quantities are unequal. Consider the data set 1, 2, 3, 4, 5, 6, 7, 8, 9. a. Use the defining formula to obtain the sample standard deviation. b. Replace the 9 in the data set by 99, and again use the defining formula to compute the sample standard ...2, 5, 0, – 1 We have provided simple data sets for you to practice the basics of finding measures of variation. For each data set, determine the a. Range. b. Sample standard deviation. Active Management of Labor (AML) was introduced in the 1960s to reduce the amount of time a woman spends in labor during the birth process. R. Rogers et al. conducted a study to determine whether AML also translates into a ...The U.S. Federal Bureau of Investigation publishes the annual crime rates for each state and the District of Columbia in the document Crime in the United States. Those rates, given per 1000 population, are provided on the ...Post your question