Section 5.2.1 defined a pattern distance measure between closed patterns (P_{1}) and (P_{2}) as [text { Pat_Dist

Question:

Section 5.2.1 defined a pattern distance measure between closed patterns \(P_{1}\) and \(P_{2}\) as

\[\text { Pat_Dist }\left(P_{1}, P_{2}ight)=1-\frac{\left|T\left(P_{1}ight) \cap T\left(P_{2}ight)ight|}{\left|T\left(P_{1}ight) \cup T\left(P_{2}ight)ight|}\]

where \(T\left(P_{1}ight)\) and \(T\left(P_{2}ight)\) are the supporting transaction sets of \(P_{1}\) and \(P_{2}\), respectively. Is this a valid distance metric? Show the derivation to support your answer.

Section 5.2.1

Pattern compression can be achieved by pattern clustering. Clustering techniques are described in detail in Chapters 8 and 9. In this section, it is not necessary to know the fine details of clustering. Rather, you will learn how the concept of clustering can be applied to compress frequent patterns. Clustering is the automatic process of grouping similar objects together, so that objects within a cluster are similar to one another and dissimilar to objects in other clusters. In this case, the objects are frequent patterns. The frequent patterns are clustered using a tightness measure called δ-cluster. A representative pattern is selected for each cluster, thereby offering a compressed version of the set of frequent patterns.....

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  answer-question

Data Mining Concepts And Techniques

ISBN: 9780128117613

4th Edition

Authors: Jiawei Han, Jian Pei, Hanghang Tong

Question Posted: