Question: Statistical Analysis Individual Assignment 1 (To be submitted through the Dropbox) Compile your answers in one Microsoft Word document. Copy and paste all figures from
Statistical Analysis Individual Assignment 1 (To be submitted through the Dropbox) Compile your answers in one Microsoft Word document. Copy and paste all figures from Excel. No late submissions will be allowed. Question 1 (5 Points) 1. You are provided with data for several nations from the Human Development Report, 2003. a. Compute summary statistics (mean, median, mode, standard deviation) for the variables of GDP (GDP per capita in thousands of dollars) and Internet use. Provide descriptions of the distributions based on the summary statistics. Address skewness in your description. b. Generate a scatterplot with clearly labeled axes for Internet use and GDP. Internet use will be on the Y axis. c. Compute the correlation between Internet use and GDP and interpret it. Refer to both the strength and direction of the correlation in your interpretation. d. Also interpret the correlation as a coefficient of determination (r-squared). Problem 2 (5 points) You are given data about the per capita personal income and infant mortality rate for a number of Tennessee Counties during 2010. a. Generate a scatterplot of the two variables. Describe the relationship depicted on the scatterplot. b. Compute the Pearson's correlation coefficient between the two variables. c. Provide an interpretation of the correlation obtained. Refer to both the strength and direction of the correlation in your interpretation. d. Also interpret the correlation in terms of r-squared (coefficient of determination). Question 3 (5 Points) Using the data provided from the US Statewide Crime Rate, conduct a regression to predict murder rate from unemployment. Show the results of the regression. a. What is the value of the intercept a? b. What is the value of the slope b? c. What is the predicted murder rate for a state with a unemployment of 5.2? d. Interpret the value of the R-squared for the regression model. Question 4 (10 Points) Using data provided from the US Statewide Crime data set: a. Construct a scatterplot using Excel or any software (SPSS or Minitab) between the variables \"murder rate\" and \"poverty.\" Provide an appropriate title for the chart as well as labels for both axes. b. There seems to be a problem caused by the presence of an outlier. Identify the outlier and delete it (simply erase its value, do not replace with zero). Statistical Analysis c. d. e. f. Show the new (second) scatterplot. Describe the pattern that emerges. What might this relationship imply? Compute the correlation coefficient between the two variables and interpret this correlation. Refer to both the strength and direction of the correlation in your interpretation. Also interpret the correlation in terms of r-squared (coefficient of determination). Conduct a regression to predict murder rate from poverty (when the observation for DC is removed from the data set) and interpret the results. What is the predicted value for a state with a poverty rate of 13.4? Interpret the value of the R-squared for the regression model. Statistical Analysis Individual Assignment 1 (To be submitted through the Dropbox) Value: 25 Points Compile your answers in one Microsoft Word document. Copy and paste all figures from Excel. No late submissions will be allowed. Question 1 (5 Points) 1. You are provided with data for several nations from the Human Development Report, 2003. a. Compute summary statistics (mean, median, mode, standard deviation) for the variables of GDP (GDP per capita in thousands of dollars) and Internet use. Provide descriptions of the distributions based on the summary statistics. Address skewness in your description. b. Generate a scatterplot with clearly labeled axes for Internet use and GDP. Internet use will be on the Y axis. c. Compute the correlation between Internet use and GDP and interpret it. Refer to both the strength and direction of the correlation in your interpretation. d. Also interpret the correlation as a coefficient of determination (r-squared). Question 2 (5 points) You are given data about the per capita personal income and infant mortality rate for a number of Tennessee Counties during 2010. a. Generate a scatterplot of the two variables. Describe the relationship depicted on the scatterplot. b. Compute the Pearson's correlation coefficient between the two variables. c. Provide an interpretation of the correlation obtained. Refer to both the strength and direction of the correlation in your interpretation. d. Also interpret the correlation in terms of r-squared (coefficient of determination). Question 3 (5 Points) Using the data provided from the US Statewide Crime Rate, conduct a regression to predict murder rate from unemployment. Show the results of the regression. a. What is the value of the intercept a? b. What is the value of the slope b? c. What is the predicted murder rate for a state with a unemployment of 5.2? d. Interpret the value of the R-squared for the regression model. Question 4 (10 Points) Using data provided from the US Statewide Crime data set: a. Construct a scatterplot using Excel or any software (SPSS or Minitab) between the variables \"murder rate\" and \"poverty.\" Provide an appropriate title for the chart as well as labels for both axes. Statistical Analysis b. There seems to be a problem caused by the presence of an outlier. Identify the outlier and delete it (simply erase its value, do not replace with zero). Show the new (second) scatterplot. Describe the pattern that emerges. What might this relationship imply? c. Compute the correlation coefficient between the two variables and interpret this correlation. Refer to both the strength and direction of the correlation in your interpretation. Also interpret the correlation in terms of r-squared (coefficient of determination). d. Conduct a regression to predict murder rate from poverty (when the observation for DC is removed from the data set) and interpret the results. e. What is the predicted value for a state with a poverty rate of 13.4? f. Interpret the value of the R-squared for the regression model. C1-T INTERNET GDP Algeria 0.65 Argentina 10.08 Australia 37.14 Austria 38.7 Belgium 31.04 Brazil 4.66 Canada 46.66 Chile 20.14 China 2.57 Denmark 42.95 Egypt 0.93 Finland 43.03 France 26.38 Germany 37.36 Greece 13.21 India 0.68 Iran 1.56 Ireland 23.31 Israel 27.66 Japan 38.42 Malaysia 27.31 Mexico 3.62 Netherlands 49.05 New Zealand 46.12 Nigeria 0.1 Norway 46.38 Pakistan 0.34 Philippines 2.56 Russia 2.93 Saudi Arabia 1.34 South Africa 6.49 Spain 18.27 Sweden 51.63 Switzerland 30.7 Turkey 6.04 United Kingdo 32.96 United States 50.15 Vietnam 1.24 Yemen 0.09 CO2 6.09 11.32 25.37 26.73 25.52 7.36 27.13 9.19 4.02 29 3.52 24.43 23.99 25.35 17.44 2.84 6 32.41 19.79 25.13 8.75 8.43 27.19 19.16 0.85 29.62 1.89 3.84 7.1 13.33 11.29 20.15 24.18 28.1 5.89 24.16 34.32 2.07 0.79 CELLULAR 3 3.8 18.2 7.6 10.2 1.8 14.4 4.2 2.3 9.3 2 11.3 6.1 9.7 8.2 1.1 4.8 10.8 10 9.1 5.4 3.9 8.5 8.1 0.3 8.7 0.7 1 9.8 11.7 7.9 6.8 5.3 5.7 3.1 9.2 19.7 0.6 1.1 0.3 19.3 57.4 81.7 74.7 16.7 36.2 34.2 11 74 4.3 80.4 60.5 68.2 75.1 0.6 3.2 77.4 90.7 58.8 31.4 21.7 76.7 59.9 0.3 81.5 0.6 15 5.3 11.3 24.2 73.4 79 72.8 29.5 77 45.1 1.5 0.8 FERTILITY 2.8 2.4 1.7 1.3 1.7 2.2 1.5 2.4 1.8 1.8 3.3 1.7 1.9 1.4 1.3 3 2.3 1.9 2.7 1.3 2.9 2.5 1.7 2 5.4 1.8 5.1 3.2 1.1 4.5 2.6 1.2 1.6 1.4 2.4 1.6 2.1 2.3 7 LITERACY 58.3 96.9 100 100 100 87.2 100 95.7 78.7 100 44.8 100 100 100 96.1 46.4 70.2 100 93.1 100 84 89.5 100 100 57.7 100 28.8 95 99.4 68.2 85 96.9 100 100 77.2 100 100 90.9 26.9 County Anderson Bedford Benton Bledsoe Blount Bradley Campbell Cannon Carroll Carter Cheatham Chester Claiborne Clay Cocke Coffee Crocket Cumberland Davidson Decatur DeKalb Dickson Dyer Fayete Fentress Franklin Per Capita Personal income (2010) Infant Mortality Rate (2010) $34,358 $29,667 $27,129 $23,666 $29,365 $30,030 $27,236 $29,927 $29,227 $27,108 $30,950 $26,679 $26,810 $25,449 $24,742 $31,913 $29,336 $27,920 $45,913 $31,265 $29,971 $29,655 $31,136 $41,652 $27,347 $28,169 7.2 7.8 19.4 15.2 6.7 1.8 0 7.1 3.1 12.3 12.2 10.9 6.1 0 2.8 9.1 17.1 13.8 7.6 9.5 4.3 12.1 10.7 13.2 11 2.6 State violent crime ratmurder rate poverty Alabama 486 7.4 Alaska 567 4.3 Arizona 532 7 Arkansas 445 6.3 California 622 6.1 Colorado 334 3.1 Connecticut 325 2.9 Delaware 684 3.2 District of Colum 1508 41.8 Florida 812 5.6 Georgia 505 8 Hawaii 244 2.9 Idaho 253 1.2 Illinois 657 7.2 Indiana 349 5.8 Iowa 266 1.6 Kansas 389 6.3 Kentucky 295 4.8 Lousiana 681 12.5 Maine 110 1.2 Maryland 787 8.1 Massachusetts 476 2 Michigan 555 6.7 Minnesota 281 3.1 Mississippi 361 9 Missouri 490 6.2 Montana 241 1.8 Nebraska 328 3.7 Nevada 524 6.5 New Hampshire 175 1.8 New Jersey 384 3.4 New Mexico 758 7.4 New York 554 5 North Carolina 498 7 North Dakota 81 0.6 Ohio 334 3.7 Oklahoma 496 5.3 Oregon 351 2 Pennsylvania 420 4.9 Rhode Island 298 4.3 South Carolina 805 5.8 South Dakota 167 0.9 Tennessee 707 7.2 Texas 545 5.9 Utah 256 1.9 Vermont 114 1.5 Virginia 282 5.7 Washington 370 3.3 West Virginia 317 2.5 Wisconsin 237 3.2 Wyoming 267 2.4 high school 14.7 8.4 13.5 15.8 14 8.5 7.7 9.9 17.4 12 12.5 10.6 13.3 10.5 8.3 7.9 10.5 12.5 18.5 9.8 7.3 10.2 10.2 7.9 15.5 9.8 16 10.7 10.1 7.6 8.1 19.3 14.7 13.2 12.8 11.1 14.1 12.9 9.8 10.2 12 9.4 13.4 14.9 8.1 10.3 8.1 9.5 15.8 9 11.1 77.5 90.4 85.1 81.7 81.2 89.7 88.2 86.1 83.2 84 82.6 87.4 86.2 85.5 84.6 89.7 88.1 78.7 80.8 89.3 85.7 85.1 86.2 90.8 80.3 86.6 89.6 90.4 82.8 88.1 87.3 82.2 82.5 79.2 85.5 87 86.1 88.1 85.7 81.3 83 91.8 79.9 79.2 90.7 90 86.6 91.8 77.1 86.7 90 college single parent 20.4 28.1 24.6 18.4 27.5 34.6 31.6 24 38.3 22.8 23.1 26.3 20 27.1 17.1 25.5 27.3 20.5 22.5 24.1 32.3 32.7 23 31.2 18.7 26.2 23.8 24.6 19.3 30.1 30.1 23.6 28.7 23.2 22.6 24.6 22.5 27.2 24.3 26.4 19 25.7 22 23.9 26.4 28.8 31.9 28.6 15.3 23.8 20.6 26 23.2 23.5 24.7 21.8 20.8 22.9 25.6 44.7 26.5 25.5 19.1 17.7 21.9 22.8 19.8 20.2 23.2 29.3 23.7 24.5 22.8 24.5 19.6 30 24.3 21.4 19.6 24.2 20 20.2 26.6 26 24.3 19.1 24.6 23.5 22.5 22.8 27.4 27.1 20.7 27.9 21.5 13.6 22.5 22.2 22.1 22.3 21.7 20.8 unemployed 4.6 6.6 3.9 4.4 4.9 2.7 2.3 4 5.8 3.6 3.7 4.3 4.9 4.4 3.2 2.6 3.7 4.1 5.5 3.5 3.9 2.6 3.6 3.3 5.7 3.5 4.9 3 4.1 2.8 3.8 4.9 4.6 3.6 3 4.1 3 4.9 4.2 4.1 3.9 2.3 3.9 4.2 3.2 2.9 2.2 5.2 5.5 3.5 3.9 metropolitan 70.2 41.6 87.9 49 96.7 84 95.6 81.4 100 93 69.1 72.9 38.6 84.5 71.8 44.9 56.8 48.4 75.2 36.3 92.7 92.1 82.5 70.3 36.2 68 33.4 52.2 86.6 60.3 100 57 91.9 67.2 43.4 80.9 60.6 72.8 84.5 93.8 70.2 34.5 67.9 84.6 76.4 27.9 78.2 83 41.9 67.8 29.6 C1-T INTERNET GDP Algeria 0.65 Argentina 10.08 Australia 37.14 Austria 38.7 Belgium 31.04 Brazil 4.66 Canada 46.66 Chile 20.14 China 2.57 Denmark 42.95 Egypt 0.93 Finland 43.03 France 26.38 Germany 37.36 Greece 13.21 India 0.68 Iran 1.56 Ireland 23.31 Israel 27.66 Japan 38.42 Malaysia 27.31 Mexico 3.62 Netherlands 49.05 New Zealand 46.12 Nigeria 0.1 Norway 46.38 Pakistan 0.34 Philippines 2.56 Russia 2.93 Saudi Arabia 1.34 South Africa 6.49 Spain 18.27 Sweden 51.63 Switzerland 30.7 Turkey 6.04 United Kingdo 32.96 United States 50.15 Vietnam 1.24 Yemen 0.09 INTERNET CO2 CELLULAR 6.09 11.32 25.37 26.73 25.52 7.36 27.13 9.19 4.02 29 3.52 24.43 23.99 25.35 17.44 2.84 6 32.41 19.79 25.13 8.75 8.43 27.19 19.16 0.85 29.62 1.89 3.84 7.1 13.33 11.29 20.15 24.18 28.1 5.89 24.16 34.32 2.07 0.79 3 3.8 18.2 7.6 10.2 1.8 14.4 4.2 2.3 9.3 2 11.3 6.1 9.7 8.2 1.1 4.8 10.8 10 9.1 5.4 3.9 8.5 8.1 0.3 8.7 0.7 1 9.8 11.7 7.9 6.8 5.3 5.7 3.1 9.2 19.7 0.6 1.1 0.3 19.3 57.4 81.7 74.7 16.7 36.2 34.2 11 74 4.3 80.4 60.5 68.2 75.1 0.6 3.2 77.4 90.7 58.8 31.4 21.7 76.7 59.9 0.3 81.5 0.6 15 5.3 11.3 24.2 73.4 79 72.8 29.5 77 45.1 1.5 0.8 GDP Mean 21.13974359 Mean 15.99333333 Standard Error 2.9571042 Standard Error 1.696898987 Median 20.14 Median 17.44 Mode #N/A Mode #N/A Standard Devia18.46710981 Standard Devia10.59713078 FERTILITY 2.8 2.4 1.7 1.3 1.7 2.2 1.5 2.4 1.8 1.8 3.3 1.7 1.9 1.4 1.3 3 2.3 1.9 2.7 1.3 2.9 2.5 1.7 2 5.4 1.8 5.1 3.2 1.1 4.5 2.6 1.2 1.6 1.4 2.4 1.6 2.1 2.3 7 Sample Varian 341.0341447 Sample Varian 112.2991807 Kurtosis -1.55535288 Kurtosis -1.55523261 Skewness 0.235082735 Skewness 0.019247525 Range 51.54 Range 33.53 Minimum 0.09 Minimum 0.79 Maximum 51.63 Maximum 34.32 Sum 824.45 Sum 623.74 Count 39 Count 39 LITERACY 58.3 96.9 100 100 100 87.2 100 95.7 78.7 100 44.8 100 100 100 96.1 46.4 70.2 100 93.1 100 84 89.5 100 100 57.7 100 28.8 95 99.4 68.2 85 96.9 100 100 77.2 100 100 90.9 26.9 County Anderson Bedford Benton Bledsoe Blount Bradley Campbell Cannon Carroll Carter Cheatham Chester Claiborne Clay Cocke Coffee Crocket Cumberland Davidson Decatur DeKalb Dickson Dyer Fayete Fentress Franklin Per Capita Personal income (2010) Infant Mortality Rate (2010) $34,358 $29,667 $27,129 $23,666 $29,365 $30,030 $27,236 $29,927 $29,227 $27,108 $30,950 $26,679 $26,810 $25,449 $24,742 $31,913 $29,336 $27,920 $45,913 $31,265 $29,971 $29,655 $31,136 $41,652 $27,347 $28,169 7.2 7.8 19.4 15.2 6.7 1.8 0 7.1 3.1 12.3 12.2 10.9 6.1 0 2.8 9.1 17.1 13.8 7.6 9.5 4.3 12.1 10.7 13.2 11 2.6 State violent crime rmurder rate poverty Alabama 486 7.4 Alaska 567 4.3 Arizona 532 7 Arkansas 445 6.3 California 622 6.1 Colorado 334 3.1 Connecticut 325 2.9 Delaware 684 3.2 Florida 812 5.6 Georgia 505 8 Hawaii 244 2.9 Idaho 253 1.2 Illinois 657 7.2 Indiana 349 5.8 Iowa 266 1.6 Kansas 389 6.3 Kentucky 295 4.8 Lousiana 681 12.5 Maine 110 1.2 Maryland 787 8.1 Massachusetts 476 2 Michigan 555 6.7 Minnesota 281 3.1 Mississippi 361 9 Missouri 490 6.2 Montana 241 1.8 Nebraska 328 3.7 Nevada 524 6.5 New Hampshir 175 1.8 New Jersey 384 3.4 New Mexico 758 7.4 New York 554 5 North Carolina 498 7 North Dakota 81 0.6 Ohio 334 3.7 Oklahoma 496 5.3 Oregon 351 2 Pennsylvania 420 4.9 Rhode Island 298 4.3 South Carolina 805 5.8 South Dakota 167 0.9 Tennessee 707 7.2 Texas 545 5.9 Utah 256 1.9 Vermont 114 1.5 Virginia 282 5.7 Washington 370 3.3 West Virginia 317 2.5 Wisconsin 237 3.2 Wyoming 267 2.4 14.7 8.4 13.5 15.8 14 8.5 7.7 9.9 12 12.5 10.6 13.3 10.5 8.3 7.9 10.5 12.5 18.5 9.8 7.3 10.2 10.2 7.9 15.5 9.8 16 10.7 10.1 7.6 8.1 19.3 14.7 13.2 12.8 11.1 14.1 12.9 9.8 10.2 12 9.4 13.4 14.9 8.1 10.3 8.1 9.5 15.8 9 11.1 high school college 77.5 90.4 85.1 81.7 81.2 89.7 88.2 86.1 84 82.6 87.4 86.2 85.5 84.6 89.7 88.1 78.7 80.8 89.3 85.7 85.1 86.2 90.8 80.3 86.6 89.6 90.4 82.8 88.1 87.3 82.2 82.5 79.2 85.5 87 86.1 88.1 85.7 81.3 83 91.8 79.9 79.2 90.7 90 86.6 91.8 77.1 86.7 90 20.4 28.1 24.6 18.4 27.5 34.6 31.6 24 22.8 23.1 26.3 20 27.1 17.1 25.5 27.3 20.5 22.5 24.1 32.3 32.7 23 31.2 18.7 26.2 23.8 24.6 19.3 30.1 30.1 23.6 28.7 23.2 22.6 24.6 22.5 27.2 24.3 26.4 19 25.7 22 23.9 26.4 28.8 31.9 28.6 15.3 23.8 20.6 single parent unemployed metropolitan 26 4.6 70.2 23.2 6.6 41.6 23.5 3.9 87.9 24.7 4.4 49 21.8 4.9 96.7 20.8 2.7 84 22.9 2.3 95.6 25.6 4 81.4 26.5 3.6 93 25.5 3.7 69.1 19.1 4.3 72.9 17.7 4.9 38.6 21.9 4.4 84.5 22.8 3.2 71.8 19.8 2.6 44.9 20.2 3.7 56.8 23.2 4.1 48.4 29.3 5.5 75.2 23.7 3.5 36.3 24.5 3.9 92.7 22.8 2.6 92.1 24.5 3.6 82.5 19.6 3.3 70.3 30 5.7 36.2 24.3 3.5 68 21.4 4.9 33.4 19.6 3 52.2 24.2 4.1 86.6 20 2.8 60.3 20.2 3.8 100 26.6 4.9 57 26 4.6 91.9 24.3 3.6 67.2 19.1 3 43.4 24.6 4.1 80.9 23.5 3 60.6 22.5 4.9 72.8 22.8 4.2 84.5 27.4 4.1 93.8 27.1 3.9 70.2 20.7 2.3 34.5 27.9 3.9 67.9 21.5 4.2 84.6 13.6 3.2 76.4 22.5 2.9 27.9 22.2 2.2 78.2 22.1 5.2 83 22.3 5.5 41.9 21.7 3.5 67.8 20.8 3.9 29.6