Question: using pandas Task 4: Bootstraping from the sample (22pt) Now let's forget that you ever had the data from the whole population. Say, you only

using pandas

using pandas Task 4: Bootstraping from the sample (22pt) Now let's forget

Task 4: Bootstraping from the sample (22pt) Now let's forget that you ever had the data from the whole population. Say, you only have your initial sample of 25 participants. How would you get an estimate of the confidence interval of the mean of the sample? Question 1 (8pt) Load the sample stored in run 10sample.csv. Now call the resample function with that time data, in each loop resampling 25 data points from 25 data points (with replacement!). You have now taken 1000 bootstrap samples. Report the mean of the bootstrap samples, the standard deviation of the the samples, and the 95% confidence interval. In [ ]: Question 2 (pt) Plot of the histogram of your bootstrap samples -plot the lower and upper bound of the confidence interval as a vertical line - see plt.axvline (see homework 3). Plot the true population mean (Task 2.1) as a red vertical line. Make sure that the graph has x- and y-labels. In [] : Question 3: (6pt) Written answer: Does the 95% confidence interval include the true population mean? Does it include the mean value from last year's cherry blossom run (101min)? Is there statistical evidence from your sample of N=25, that the race times have gotten faster from last year? Task 4: Bootstraping from the sample (22pt) Now let's forget that you ever had the data from the whole population. Say, you only have your initial sample of 25 participants. How would you get an estimate of the confidence interval of the mean of the sample? Question 1 (8pt) Load the sample stored in run 10sample.csv. Now call the resample function with that time data, in each loop resampling 25 data points from 25 data points (with replacement!). You have now taken 1000 bootstrap samples. Report the mean of the bootstrap samples, the standard deviation of the the samples, and the 95% confidence interval. In [ ]: Question 2 (pt) Plot of the histogram of your bootstrap samples -plot the lower and upper bound of the confidence interval as a vertical line - see plt.axvline (see homework 3). Plot the true population mean (Task 2.1) as a red vertical line. Make sure that the graph has x- and y-labels. In [] : Question 3: (6pt) Written answer: Does the 95% confidence interval include the true population mean? Does it include the mean value from last year's cherry blossom run (101min)? Is there statistical evidence from your sample of N=25, that the race times have gotten faster from last year

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!