Question: Suppose that we have the following data: a b c d e f g h i j (2,0) (1,2) (2,2) (3,2) (2,3) (3,3) (2,4) (3,4)
Suppose that we have the following data: a b c d e f g h i j (2,0) (1,2) (2,2) (3,2) (2,3) (3,3) (2,4) (3,4) (4,4) (3,5)
Identify the cluster by applying the k-means algorithm, with k = 2. Try using initial cluster centers as far apart as possible.
Show that the ratio of the between-cluster variation to the within-cluster variation decreases with each pass of the algorithm.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
