Question: Consider the (relative distance) K-means scheme for outlier detection described in Section 10.5 and the accompanying figure, Figure 10.10. (a) The points at the bottom

Consider the (relative distance) K-means scheme for outlier detection described in Section 10.5 and the accompanying figure, Figure 10.10.
(a) The points at the bottom of the compact cluster shown in Figure 10.10 have a somewhat higher outlier score than those points at the top of the compact cluster. Why?
(b) Suppose that we choose the number of clusters to be much larger, e.g., 10. Would the proposed technique still be effective in finding the most extreme outlier at the top of the figure? Why or why not?
(c) The use of relative distance adjusts for differences in density. Give an example of where such an approach might lead to the wrong conclusion.

Step by Step Solution

★★★★★

3.40 Rating (163 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

a The mean of the points is pulled somewhat upward fro... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Document Format (1 attachment)

908-M-S-D-A (8722).docx

120 KBs Word File

Students Have Also Explored These Related Statistics Questions!

In Problems 5-17 and 5-18. three different forecasts were developed for the demand for fertilizer. These three forecasts are a 3-year moving average, a weighted moving average, and a trend line....

Let Xn be a sequence of random variables that converges in distribution to a random variable X. Let Yn be a sequence of random variables with the property that for any finite number c, Show that for...

Consider the percentage change in stock price of the most active issues traded on the NASDAQ stock exchange, as shown in Table 3.9.3. In Table 3.9.3 a. Construct a histogram of this data set. b....

Consider the outlier- deleted regression model of Exercise 12.59. a. Locate the F statistic. What null hypothesis is being tested? What can we conclude based on the F statistic? b. Locate the t...

Jones & Bartlett Learning, LLC. NOT FOR RESALE OR DISTRIBUTION CHAPTER Hot Spot Analysis 10 LEARNING OBJECTIVES C A R R Provide a working definition of a \"hot spot.\" , Be able to explain different...

2. Consider the following data set consisting of four points: X1 = - [8], --[1].-= []. - [ ] Plot these points. We want to split this set to two classes. (a) Suppose you start the K-means algorithm...

Write 2 paragraphs about Macro risks and the term structure of interest rates article. No max word count, page count, or formatting requirements but has to be submit to my tutor's work as my own....

Summarize this for me Someone who understand MEA and HEA very well. And please make sure the summary is 2 pages . no less than 2 pages plz thanks Effects of temperature on mechanical properties and...

Describing Data Once we have collected data from surveys or experiments, we need to summarize and present the data in a way that will be meaningful to the reader. We will begin with graphical...

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

Hello, you already did chapter 1 and 2 of my MASTERS Thesis already for me. (See attached) So normally you know masters thesis consist of 5 chapters right ??..... But in this case my thesis will be 4...

Casciani Company invests $61,000 today in a savings account that earns 10% compounded annually. What will be the balance in the savings account 10 years from today (e.g., future value)?

The following data apply to the provision of psychological testing services. Sales price per unit (1 unit _ 1 test plus feedback to client).................... $ 600 Fixed costs (per month): Selling...

a . [ 1 0 marks ] Never seen 'such a complete failure' of corporate controls, says new FTX CEO who also oversaw Enron bankruptcy Manateremell f in o generawssawamces - geasmass Until recently,...

a) You deposit $800 into a savings account that pays 2.5% compounded annually. How much will you have in the account if you leave the money there for 6 years? b) You would like to have $30,000 in an...

Kent Market (KM) is a major food cooperative. Suppose KM begins 2017 with cash of $5 million. KM estimates cash receipts during 2017 will total $103 million. Planned payments will total $93 million....

Hana Ascot is planning to start a business. Identify for Hana the advantages and disadvantages of the corporate form of business organization.

Say whether each of the following scenarios describes an insurance problem caused by adverse selection or by moral hazard. a. People who have homeowners insurance are less likely than others to...

During its first month of operation, the Rawls Repair Corporation, which specializes in bicycle repairs, completed the following transactions: Oct. 1Began business by making a deposit in a company...

1.11. Based on the information in Prob. 1.10, what effect does the purchase of the equipment on account have on capital?

1. Does Steve need additional information from Iverstine and Walker? 2. What would you recommend? In 1979, Steve Blake founded Blake Electronics in Long Beach, California, to manufacture resistors,...

Ray Bond, from Problem 1-16, is trying to find a new supplier that will reduce his variable cost of production to $ 15 per unit. If he was able to succeed in reducing this cost, what would the...

What is the level of measurement for each of the following variables? a. Student IQ ratings. b. Distance students travel to class. c. The jersey numbers of a sorority soccer team. d. A students state...

Haskell Corp. is comparing two different capital structures. Plan I would result in 16,000 shares of stock and $100,000 in debt. Plan II would result in 12,000 shares of stock and $200,000 in debt....

1.Dental Diversity provides three basic service appointments; Cleaning, Check-up, and Full service. Total annual costs average $497.000. An RVU analysis indicates the following: Annual Average Appt....

What is the capital structure of this company based on market values? Bond issue #1, maturity 12 years, coupon rate 9%, current YTM 9.75%, book value $10 million Bond issue #2, maturity 16 years,...