Question: Question 2 (30 points) We focus on the following subset of the variables: regime, oil, logGDPcp, and illit. Remove observations that have missing values in

Question 2 (30 points) We focus on the following subset of the variables: regime, oil, logGDPcp, and illit. Remove observations that have missing values in any of these variables. Using the scale () function, scale these variables so that each variable has a mean of zero and a standard deviation of one. Fit the k-means clustering algorithm with two clusters. . How many observations are assigned to each cluster? . Using the original unstandardized data, compute the means of these variables in each cluster and report them. Question 3 (20 points) Using the clusters obtained above, modify the scatterplot between logged GDP per capita and illiteracy rate in the following manner. Use different colors for the clusters so that we can easily tell the cluster membership of each observation. In addition, make the size of each circle proportional to the oil variable so that oil-rich countries stand out (hint: adjust the cex parameter). Briefly comment on the results. Question 4 (20 points) Repeat the previous two questions but this time with three clusters instead of two. How are the results different? Which clustering model would you prefer and why

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

The accompanying data file contains the data from the National Longitudinal Survey (NLS), which follows over 12,000 individuals in the United States over time. Variables in this analysis include the...

The best definition for nonparametric statistics I can think of is when data does not fit inside the normal parameters of a normal distribution, it calls for the use of this statistical method....

Hello, I would like a presentation to be made for 2 sections from the attached article! DOES THE CHOICE OF EXCHANGE RATE REGIME AFFECT THE ECONOMIC GROWTH OF DEVELOPING COUNTRIES? Glauco De Vita...

Instuctor's Annotated Edition TENTH EDITION Understandable Statistics Concepts and Methods Charles Henry Brase Regis University Corrinne Pellillo Brase Arapahoe Community College Australia Brazil...

Set Student Name: 1. Describe the relationship between two variables that have a correlation coefficient value: a. Near -1 b. Near 0 c. Near 1 2. Data was collected where a weightlifter was asked to...

ID Salary 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 59.3 26.5 34.1 56.3 46.8 79.5 41.2 23.6 76 22.7 23.8 62.9 41.4 22.3 23.1 46.6 67.6 33.8 24 35 75.2 51.5 22.1 55.4 24.9 24.3 42.9 76.6 75.4...

Week 2: Understanding and Exploring Assumptions You will submit one Word document, including your SPSS output. 1. Why do we care whether the assumptions required for statistical tests are met? (Tip:...

Set Week Three During this week, we will look at ways of testing multiple (more than two) data samples at the same time. We will continue to use the data and assignment file that we opened in Week 2,...

MKT572 Group Project2 There are several requirements for group project. Failure to meet the submission requirement will result in a deduction of 10 points from your grade. 1. In your cover page, list...

1. What is the issue being addressed in the paper? 2. What are the findings of the paper? 3. Why is this paper important to auditors, and what are the implications of this paper for the auditing...

Define the following terms. Owner Registrable instrument Beacons Demarcation

After teaching a class on game theory, your instructor announces that if every student skips the last question on the next exam, everyone will receive full credit for that question. However, if one...

The following information has been taken from the accounting records of EH Corporation for last year: Selling expenses P 70,000 Raw materials inventory, January 1 45,000 Raw materials inventory,...

Compute the amount of gross profit to be recognized each year, assuming the percentage-of-completion method is used. Gross profit recognized in 2025 Gross profit recognized in 2026 \$ Gross profit...