Question: MATH 250- Elements of Statistics Class Data, Spring 2025---FIRST EXAMINATION of Student Data Individual ID# Gender Foot Length Height Age Armspan Number in Family Hair

MATH 250- Elements of Statistics
Class Data, Spring 2025---FIRST EXAMINATION of Student Data
Individual ID#GenderFoot LengthHeightAgeArmspanNumber in FamilyHair Color
1Female19.0154.58133.05Brown
2Male29.0189.051198.04Brown
3Male30.5191.023185.04Brown
4Male49.0192.064194.03Grey
5Female24.0162.528161.510Brown
6Female25.0172.043165.03Blonde
7Female24.0169.533163.510Blonde
8Female21.5157.523155.03Blonde
9Female19.0167.519162.53Brown
10Male23.5175.524175.08Blonde
11Male27.0172.519170.03Brown
12Female20.0157.020161.03Brown
13Male24.0172.542172.03Brown
14Male28.0182.028183.06Brown
15Female22.0160.021140.07Brown
16Male30.0188.033185.58Blonde
17Female25.5172.521172.51Blonde
18Female24.0170.019172.04Black
19Female26.0168.029170.02Black
20Male23.0170.026176.03Brown
21Female26.0168.029170.02Black
22Male26.5190.020196.011Black
23Female23.5152.534144.53Brown
24Female26.0155.035156.06Brown
25Female24.5157.519155.04Brown
26Female28.0173.019186.03Brown
27Male27.5180.033177.03Brown
28Female26.0175.520160.04Brown
29Female24.0170.031168.04Brown
30Male28.0183.030180.54Brown
31Male27.0183.030182.514Blonde
32Female24.5174.019165.03Brown
33Female24.5163.521168.55Blonde
34Female22.0170.029170.04Blonde
35Female23.0160.019165.02Black
36Female25.0155.028153.05Red
37Female23.0152.046152.07Blonde
38Female24.5166.044165.011Brown
39Male25.5183.026183.010Brown
40Male25.5183.521179.54Black
41Female23.0160.026167.08Brown
42Male25.5183.031182.07Brown
43Female20.0155.020154.05Black
44Male27.5179.029150.03Brown
45Female26.5144.524146.01Brown

46

Female23.0160.037159.03Brown
47Male27.5187.038185.02Brown
48Female24.0168.033164.09Brown
49Male27.0188.036185.02Brown
50Female27.0169.031161.54Brown
51Female23.0160.019157.04Brown
52Female24.0159.032161.54Brown
53Female25.0165.018153.54Brown
54Female23.5165.530166.510Black
55Male27.0190.021189.05Brown
56Female23.0167.035154.02Brown
57Male20.5168.544171.06Brown
58Female24.0167.552162.55Brown
59Female26.0175.520164.55Blonde
60Female22.5162.519162.53Black
61Male25.5183.025184.05Brown

General Instructions: Please place your name above, then complete the following questions. NOTE: Read the entire document below to get a feel for the activity before continuing. Make sure to save this Excel file using the filename "yournameActivity5.xlsx". Once complete, submit your answers to this activity by attaching your Excel file to the Speadsheet Activity 5 - Probability Distributions assignment in Blackboard. Use the area to the near right in this Excel worksheet when calculating all values/statistics/parameters. Methods/work to calculate values must be shown in this spreadsheet tab in order to receive full credit. (Work for part 1. should be shown in the tab labelled "Original Data Set for Analysis")

overview:

This activity has three major purposes. First, it is designed to show the importance of examining the data prior to performing statistical calculations. Second, the activity should help you recognize the difference between a discrete random variable and a continuous random variable. Finally, the activity is designed to help you see how the descriptive statistical analysis differ for both types of data. The data to be used in answering the questions below comes from the data collected in the first spreadsheet activity. The sample data set collected from the students of this course originally had a size of n = 137. However, to make the set a bit more manageable for beginning statistics, a collection of 61 individuals' data was randomly selected. You may recognize your own data within this set if you were one of the randomly selected individuals. This data is supplied in the attached worksheet titled "Original Data Set for Analysis"...see the tab at the bottom of this document window.

1. The first step with analyzing data is to make sure that all data values were sbumitted correctly and seem to be reasonable/proper measurementsthis is called cleaning the data. In initial analysis of the student data by instructors, there were several mismeasurments or incorrectly given measurements; much of this was cleaned already. More formally one would also look for possible outliers using a process (like the 1.5IQR rule) and decide whether or not to include these data in further analysis. In general, a valid and well established argument should always be given for removal of any data from a data set; removal of any collected data should NOT be done arbitrarily or to skew the data to some desired viewpoint. Analyze the data given in the ATTACHED worksheet (again, see this worksheet below as "Original Data Set for Analysis"). Using the 1.5IQR rule discussed in the first unit, establish there is exactly one outlier within the foot length variable. For our course, examine only the foot length variable for outliers! Once you demonstrate that an outlier exists, give the individual's ID# and data as your answer below to this question #1. FINALLY, copy the data set, excluding the outlier, to the designated region at the right (several columns over to the right on this worksheet). Do not delete the outlier's data on the "Original Data Set for Analysis" tab. We want to keep a record of all the data, but this individual's (the outlier's) data will NOT be used in any other calculations performed in answering questions #2 and #3 below in this activity. NOTE: You should be left with 60 rows of data in the region to the right when finished with this problem, even though the ID# will start with 1 and end at 61.

2. For this problem, we will first assume the data is population data (only these 60). Now we focus only on the Number in Family variable in the data set you copied to the right...in which you deleted the entire row chosen in answering #1 (again your data table should contain 60 individuals' data.) Define the random variable X to be "Number in Family" and complete the following for these sixty data values:

a.) What makes X a discrete random variable and not a continuous one?

b.) In the area to the right, create a probability distribution table showing the possible values of X, the frequency of each value, and the associated relative frequency values P(X) as determined by the collected data.

c.) Determine the expected value (mean) of the random variable X using your probability distribution table created in part b. directly above. (Hint: the requirement is to use only the information in the table you produced in part b., not to use the raw data---review how to produce the mean from a probability distribution table via the text or the Excel Guides for Unit 2.)

MEAN :__________

d.) Determine the standard deviation of the random variable X, again using only the values within your probability distribution table. (You can check your answers by finding the population s.d. of the data on family size, BUT this problem needs to be answered through use of only the probability distribution table built in part b--again see the text and Excel Guides.)

St. Deviation :___________

e.) Frequently, any data outside of the Two- Sigma Rule interval is considered "unusual." Decide if any of the included values of the random variable X are unusual. Give a concluding statement below in regard to your decision. Show work for 2 Sigma Rule to the right of the problem.

f.) Determine the probability that the random variable X is within one standard deviation of the mean. Show work for calculating the lower and upper bounds of the event in question and the probability to the right of the problem. Express your answer in a complete sentence.

3. Now consider your cleaned data in reference to the Footlength variable/factor of the student data. Notice that this variable is categorized as quantitative, continuous, and ratio level in type. (This portion of the activity is Based on Workshop Statistics, Rossman, p. 66)

a.) In the area to the right, copy the Footlength data values (again take the cleaned data set of 60 values) and then sort them in order from least to greatest. From this column of footlength values, create an appropriate frequency table with exactly six classes--remember, we did such frequency charts back in Unit 1. For the next part, b, you may choose to produce the histogram graph at the same time. Finally extend your frequency table to include a relative frequency (probability) column.

b.) Produce a histogram for your frequency table (if you did not do so as you constructed your table in part a). Describe the distribution. Does the distribution of the footlength data appear to be a roughly normal distribution? Explain your answer briefly.

c.) Compute the mean and standard deviation of the footlength data values...not from the frequency table as done in the discrete case above but as done in the first unit, but this time assuming this is SAMPLE DATA of all stats students.

Sample mean, xbar ( x ):_________

Sample std. deviation, s: __________

d.) Determine the proportion of the students in this sample whose footlength is at least 25.5 cm.

e.) Suppose that the footlengths in the population of all university students taking elementary statistics do in fact follow a perfect normal distribution (though our sample group did not) with the population mean = 24.5 cm and population standard deviation = 2.2 cm . Under this assumption, determine the proportion of all students who have footlength measure greater than 25.5 cm.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock