Question: it's regarding machine learning Problem 8. (20 points) Understanding the curse of dimensionality. Consider the following experiment: generate n data points with dimensionality k. Let

it's regarding machine learning
Problem 8. (20 points) Understanding the curse of dimensionality. Consider the following experiment: generate n data points with dimensionality k. Let cach data point be generated using a uniform random number generator with values between 0 and 1. Now, for a given k, calculate dmax (k) - dmin (k) r(k) = 10510 - dmin(k) where dmax() is the maximum distance between any pair of points and dmin(k) is minimum distance between any pair of points (you cannot use identical points to obtain the minimum distance of O). Let k take cach value from {1,2,..., 99, 100}. Repeat cach experiment multiple times to get stable values by averaging the quantities over multiple runs for cach k. a) (15 points) Plot r(k) as a function of k for two different values of n; n 100, 1000). Label and scale cach axis properly to be able to make comparisons over different n's. Embed your final picture(s) in the file you are submitting for this assignment. b) (5 points) Discuss your observations and also compare the results to your expectations before you carried out the experiment. Problem 8. (20 points) Understanding the curse of dimensionality. Consider the following experiment: generate n data points with dimensionality k. Let cach data point be generated using a uniform random number generator with values between 0 and 1. Now, for a given k, calculate dmax (k) - dmin (k) r(k) = 10510 - dmin(k) where dmax() is the maximum distance between any pair of points and dmin(k) is minimum distance between any pair of points (you cannot use identical points to obtain the minimum distance of O). Let k take cach value from {1,2,..., 99, 100}. Repeat cach experiment multiple times to get stable values by averaging the quantities over multiple runs for cach k. a) (15 points) Plot r(k) as a function of k for two different values of n; n 100, 1000). Label and scale cach axis properly to be able to make comparisons over different n's. Embed your final picture(s) in the file you are submitting for this assignment. b) (5 points) Discuss your observations and also compare the results to your expectations before you carried out the experiment
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
