Question: You have been provided with a dataset containing information about customer spending habits. Your task is to use the k - means algorithm with Euclidean

You have been provided with a dataset containing information about customer spending habits. Your task is to use the k-means algorithm with Euclidean distance to cluster the following 8 examples into 3 clusters:
\table[[Customer,Spending (in $)],[C1,200],[C2,150],[C3,300]]
Suppose that the initial seeds (centres of each cluster) are C1, C4, and C7. Run the k-means algorithm for 1 epoch only.
In particular:
a) Fill the distance matrix based on the Euclidean distance of the points given above:
\table[[,C1,C2,C3,C4,C5,C6,C7,C8],[C1,0,,,,,,,],[C2,,0,,,,,,],[C3,,,0,,,,,],[C4,,,,0,,,,],[C5,,,,,0,,,],[C6,,,,,,0,,],[C7,,,,,,,0,],[C8,,,,,,,,0]]
b) Calculate the cluster assignment at the end of the first epoch:
a. The new cluster assignment (i.e., contents of each cluster)
b. The centroids of the new clusters
c) How many more iterations are needed to converge? Show cluster assignments and updated centroids for each of the remaining epochs.
In your report, you need to include the appropriate cluster assignment and centroids.
 You have been provided with a dataset containing information about customer

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!