Question: We want to cluster categorical data, i . e . data that have categorical attribute domains. The k - medoid algorithm can be applied to
We want to cluster categorical data, ie data that have categorical attribute domains. The medoid
algorithm can be applied to any datasets with a given pairwise distance function and, therefore, is
applicable also to categorical data. The means algorithm, on the other hand, is much more efficient
than the medoid algorithm, but it requires numeric data. The task of this assignment is to develop
an analogion to the means algorithm for categorical data. We assume the following distance
function for pairs of categorical objects:
dist with
a What is the analogion for the means of a cluster for categorical data? Note that must be
computable by scanning the set of objects of once similar to the computation of the cluster
means marks
Hint: If a concept is in analogy another concept then is said to be an analogion for
Hint: an analogion for the means of a cluster refers to a new definition of the means of a
cluster of categorical data, where the original definition of cluster centre used by means does
not work anymore.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
