In this task the idea is to study the behaviour of extreme distances and relative contrast...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
In this task the idea is to study the behaviour of extreme distances and relative contrast with different Lp norm based distance measures when the dimensionality of data increases. The relative contrast for data vector x; is defined as Ci = 2 Dmax — Dmin Dmin 2 where Dmax and Dmin are the maximum and minimum distances from vector x; to any other vector in the data, calculated with a given distance function. In the experiment, vary the dimensionality as d = 1, 2, 3, 4, 5, 10, 20,..., 100. For each d, simulate random data with n = 100 data points x; E Rd, where each component has uniform distribution in the range [0, 1]. For each x; € {x₁,...,Xn}, find the minimum, average and maximum distances from it to all other vectors. Then calculate the difference between the maximum and minimum distances and the relative contrast value. Finally compute the averages of all these five measures over all xi. Plot the average values and their logarithms as functions of increasing d. (Place all distance values in one plot and the contrast values in another plot. Similarly, do not mix the original and the logarithmic values in the same plot. You should thus have in total four plots.) Repeat the experiment four times with the following Lp measures as the distance function: L0.5, L₁, L2, and L3. a) What happens to each one of the distance values and the difference between the maxi- mum and minimum distances when d increases? Explain, how you use the original and logarithmic plots to draw these conclusions. b) What happens to the relative contrast measure when d increases? How can this finding be explained based on the distance plots? c) What kind of an effect does the p value have on the bahavior of the relative contrast measure as a function of d? d) How do you interpret the results with respect to so called "curse of dimensionality"? In this task the idea is to study the behaviour of extreme distances and relative contrast with different Lp norm based distance measures when the dimensionality of data increases. The relative contrast for data vector x; is defined as Ci = 2 Dmax — Dmin Dmin 2 where Dmax and Dmin are the maximum and minimum distances from vector x; to any other vector in the data, calculated with a given distance function. In the experiment, vary the dimensionality as d = 1, 2, 3, 4, 5, 10, 20,..., 100. For each d, simulate random data with n = 100 data points x; E Rd, where each component has uniform distribution in the range [0, 1]. For each x; € {x₁,...,Xn}, find the minimum, average and maximum distances from it to all other vectors. Then calculate the difference between the maximum and minimum distances and the relative contrast value. Finally compute the averages of all these five measures over all xi. Plot the average values and their logarithms as functions of increasing d. (Place all distance values in one plot and the contrast values in another plot. Similarly, do not mix the original and the logarithmic values in the same plot. You should thus have in total four plots.) Repeat the experiment four times with the following Lp measures as the distance function: L0.5, L₁, L2, and L3. a) What happens to each one of the distance values and the difference between the maxi- mum and minimum distances when d increases? Explain, how you use the original and logarithmic plots to draw these conclusions. b) What happens to the relative contrast measure when d increases? How can this finding be explained based on the distance plots? c) What kind of an effect does the p value have on the bahavior of the relative contrast measure as a function of d? d) How do you interpret the results with respect to so called "curse of dimensionality"?
Expert Answer:
Answer rating: 100% (QA)
The image contains text describing a task for studying the behavior of extreme distances and relative contrast with increasing dimensionality of data The task involves conducting experiments with vary... View the full answer
Related Book For
Posted Date:
Students also viewed these programming questions
-
What are the problems with the company/individual/concepts mentioned in the case study article The Secrets of Great? Why are the problems important? In what way does the problem impact the...
-
Terminology Key: Key Word List Describe Explain What to do In bulleted, numbered or paragraph form, provide a number of consecutive items-if paragraph form, use commas (,) to separate items In...
-
Diane Smith, CA, performed tests of controls and substantive tests of transactions for sales for the month of September in an audit of financial statements for the year ended 30 June 20X1. Based on...
-
Which statement best explains the chemical differences between DNA and RNA? (a) DNA has two different sugars in its sugarphosphate backbone, but RNA only has one. (b) Thymine is one of the DNA bases,...
-
Suppose a refrigerator with its door open is operated continuously in a kitchen and the kitchen is isolated from the rest of the house. The temperature of the kitchen will (a) Decrease (b) Increase...
-
Diewold Company has two departments, Milling and Assembly. The company uses a job-order cost system and computes a predetermined overhead rate in each department. The Milling Department bases its...
-
Write an algorithm/pseudo code to transpose an nxn matrix.
-
Applied overhead to Job 201 and to Job 202 using a predetermined overhead rate is 80% of direct materials cost. Transferred Job 201 to Finished Goods Inventory. (1) Sold Job 201 for $167,060 on...
-
4. Consider a Markov process with transition matrix. State 1 State 2 State 1 0 7 State 2 (a) What do the entries and 0 represent? (b) If the system is in state 1 initially, what is the probability...
-
Anna has a balance of $5,000 in her VISA account which grows at a nominal interest rate of 19.99% compounded daily. To pay off this balance, she makes monthly payments to VISA, with her first payment...
-
One of the two alternative long term debt or common stock will move Central Furniture Company to a more optimum capital structure. 1.what criteria are used to judge optimum capital structure?
-
Simplify: 8.815.82 Provide your answer below:
-
[Software Dispute] Cole, Kyra, and Morton are partners in a startup company that designs software for the biotechnology industry. Their partnership agreement contained a provision that the...
-
In Kenya growing maize requires intensive use of fertilizer. Previously, World Bank researchers have estimated that a one percent increase in the price of fertilizer reduces the supply of maize by...
-
You go out with your husband for dinner at a new restaurant you have not frequented before. It is in a part of town you rarely visit. You are shocked to see your friends spouse having dinner with a...
-
During the month, services performed for customers on account amounted to $7,500 and collections from customers in payment of their accounts totaled $6,000. At the end of the month, the Accounts...
-
Find a basis of solutions. Try to identify the series as expansions of known functions. Show the details of your work. x 2 y" + 2x 3 y' + (x 2 - 2)y = 0
-
What property of the Laplace transform is crucial in solving ODEs?
-
Using (1) or (2), find L(f) if f(t) if equals: sin 4 t. Use Prob. 19. Data from Prob. 19 sin 2 t
-
The water tank shown in Fig. 14.33 is supported by a hollow circular steel column. The tank, made of steel, is in the form of a thin-walled pressure vessel and has a capacity of 40,000 litres. Design...
-
Using the third-order Runge-Kutta method, solve Problem 11.15. Data From Problem 11.15:- Using the second-order Runge-Kutta method, solve the differential equation \(\ddot{x}+1000 x=0\) with the...
-
Linear acceleration method a. Assumes that acceleration varies linearly between \(t_{i}\) and \(t_{i}+\theta \Delta t ; \theta \geq 1\) b. Assumes that acceleration varies linearly between \(t_{i}\)...
Study smarter with the SolutionInn App