Question: You are working with a data set consisting of 7 continuous variables and 500 observations. You wish to group the observations together as you explore

You are working with a data set consisting of 7 continuous variables and 500 observations. You wish to group the observations together as you explore the data set to help identify interesting patterns.

You decide to use KMeans clustering to group similar observations together. Why should you consider standardizing the variables before running the KMeans clustering algorithm? Select all that apply.

a. Variables with small natural levels of variation will not contribute to the distance calculation

b. Variables with large natural variation will dominate the distance calculation

c. So that you can visualize the histograms together

d. So that you can visualize the boxplots together

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!