Question: it's regarding machine learning Problem 8. (20 points) Understanding the curse of dimensionality. Consider the following experiment: generate n data points with dimensionality k. Let

 it's regarding machine learning Problem 8. (20 points) Understanding the curse

it's regarding machine learning

Problem 8. (20 points) Understanding the curse of dimensionality. Consider the following experiment: generate n data points with dimensionality k. Let cach data point be generated using a uniform random number generator with values between 0 and 1. Now, for a given k, calculate dmax (k) - dmin (k) r(k) = 10510 - dmin(k) where dmax() is the maximum distance between any pair of points and dmin(k) is minimum distance between any pair of points (you cannot use identical points to obtain the minimum distance of O). Let k take cach value from {1,2,..., 99, 100}. Repeat cach experiment multiple times to get stable values by averaging the quantities over multiple runs for cach k. a) (15 points) Plot r(k) as a function of k for two different values of n; n 100, 1000). Label and scale cach axis properly to be able to make comparisons over different n's. Embed your final picture(s) in the file you are submitting for this assignment. b) (5 points) Discuss your observations and also compare the results to your expectations before you carried out the experiment. Problem 8. (20 points) Understanding the curse of dimensionality. Consider the following experiment: generate n data points with dimensionality k. Let cach data point be generated using a uniform random number generator with values between 0 and 1. Now, for a given k, calculate dmax (k) - dmin (k) r(k) = 10510 - dmin(k) where dmax() is the maximum distance between any pair of points and dmin(k) is minimum distance between any pair of points (you cannot use identical points to obtain the minimum distance of O). Let k take cach value from {1,2,..., 99, 100}. Repeat cach experiment multiple times to get stable values by averaging the quantities over multiple runs for cach k. a) (15 points) Plot r(k) as a function of k for two different values of n; n 100, 1000). Label and scale cach axis properly to be able to make comparisons over different n's. Embed your final picture(s) in the file you are submitting for this assignment. b) (5 points) Discuss your observations and also compare the results to your expectations before you carried out the experiment

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!