Question: Consider the (relative distance) K-means scheme for outlier detection described in Section 10.5 and the accompanying figure, Figure 10.10. (a) The points at the bottom
Consider the (relative distance) K-means scheme for outlier detection described in Section 10.5 and the accompanying figure, Figure 10.10.
(a) The points at the bottom of the compact cluster shown in Figure 10.10 have a somewhat higher outlier score than those points at the top of the compact cluster. Why?
(b) Suppose that we choose the number of clusters to be much larger, e.g., 10. Would the proposed technique still be effective in finding the most extreme outlier at the top of the figure? Why or why not?
(c) The use of relative distance adjusts for differences in density. Give an example of where such an approach might lead to the wrong conclusion.
Step by Step Solution
3.40 Rating (163 Votes )
There are 3 Steps involved in it
a The mean of the points is pulled somewhat upward fro... View full answer
Get step-by-step solutions from verified subject matter experts
Document Format (1 attachment)
908-M-S-D-A (8722).docx
120 KBs Word File
