Question: We have a data set with 6 records as 2 - dimensional points, p 1 to p 6 . These points locations in a 2
We have a data set with records as dimensional points, p to p These points locations in a dimensional grid and their corresponding distance matrix are given below.
Using this data we want to come up with a means clustering and, doing so we would like
to avoid the initial centroids problem. For this purpose, we choose to implement the K
means algorithm. Suppose the first two centroids are p and p respectively. Derive the
probability distribution that will be used to randomly select the third centroid ie calculate the probabilities of each point to be picked as the third centroid Which point is most likely to be picked? Show all your calculations. pts
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
