Question: Big Data Analysis Academic Year 2 0 2 3 - 2 0 2 4 We have access to a database on different consumers according to
Big Data Analysis
Academic Year
We have access to a database on different consumers according to attributes.
tableConsumertableAverageSpendingRevenue,Height,Weight,Age
Question : Calculate the centroid of these data.
Question : Calculate the Euclidean distance between Consumer and the centroid.
Question : Calculate the Manhattan distance between Consumer and the centroid.
Question : Calculate the Euclidean and Manhattan distances between Consumer and the centroid.
Question : Will you say that Consumer is better represented by the centroid compared to consumer
Question : You want to create two clusters on this dataset. The centroid of the first cluster is given by ;;;; The centroid of the second cluster is given by C ; ; ; ; To which cluster C or C consumer will be affected? Answer using the Euclidean distance.
Question : How would you describe the difference between clusters and
Question : How does software usually choose to initialize centroid? Could it be a problem for clustering results explain
Question : Why combining PCA and Kmeans algorithms can increase the quality of the clustering?
Question : Why the combination of PCA and Kmeans improve the visualization of the clustering?
Question : Describe in practice how to combine PCA and Kmeans.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
