Question: 10.4 For the k-means algorithm, it is interesting to note that by choosing the initial cluster centers carefully, we may be able to not

10.4 For the k-means algorithm, it is interesting to note that by choosing the initial cluster centers carefully, we may be able to not only speed up the algorithm's convergence, but also guarantee the quality of the final clustering. The k-means++ algorithm is a vari- ant of k-means, which chooses the initial centers as follows. First, it selects one center uniformly at random from the objects in the data set. Iteratively, for each object p other than the chosen center, it chooses an object as the new center. This object is chosen at random with probability proportional to dist (p), where dist (p) is the distance from p to the closest center that has already been chosen. The iteration continues until k centers are selected. Explain why this method will not only speed up the convergence of the k-means algorithm, but also guarantee the quality of the final clustering results.
Step by Step Solution
3.33 Rating (147 Votes )
There are 3 Steps involved in it
The kmeans algorithm will not only speed up the convergence of the kmeans algorithm but also guarant... View full answer
Get step-by-step solutions from verified subject matter experts
