Question: Problem 2 [ 4 0 pts ] [ K - Means ] Given the matrix x whose rows represent different data points, you are asked

Problem 2
[40 pts][K-Means] Given the matrix x whose rows represent different data points, you are asked to perform a k-means clustering on this dataset using the Euclidean distance as the distance function. Here k is chosen as 3. The Euclidean distance d between a vector x=[x1,x2,dots,xp]T and a vector y=[y1,y2,dots,yp]T both in Rp is defined as d=i=1p(xi-yi)22. All data in x were plotted in Figure 1(black, red, blue, and green points). We randomly choose the following 3 points as the initialized centers of 3 clusters which are 1=(5.8,3.0)(red),2=(6.2,3.0)(blue),3=(6.5,3.5)(green).
x=[x1Tx2Tx3Tx4Tx5Tx6Tx7Tx8T]=[4.03.53.53.05.83.05.82.56.53.55.04.05.53.56.23.0]
Figure 1: Scatter plot of dataset and the initialized centers of 3 clusters.
(a)[10 pts
Problem 2 [ 4 0 pts ] [ K - Means ] Given the

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!