Question: Please include R code is possible Question 1 (understand k-means) k-means is a relatively simple algorithm that can write by ourselves. For this question, you

Please include R code is possible Question 1 (understand k-means) k-means is a relatively simple algorithm that can write by ourselves. For this

Please include R code is possible

Question 1 (understand k-means) k-means is a relatively simple algorithm that can write by ourselves. For this question, you are not allowed to use any existing functions that perform k-means directly. Let's first generate some data. You should copy this exact code to generate the same dataset and the initial cluster assignment labels set.seed (2) n - 10 # first coordinate (uam able) of each observation x1 - rnorm(n) # second coordinate (variable) of each obser ation x2 = rnorm (n) # we also generate an initial value of the cluster assignments C sample (1:2, n, replace TRUE) ## [1] 2 1 1 1 2 2 2 1 2 2 The above code means that we consider just two clusters, and if C[1] = 1, we are currently assigning observation i to cluster 1, otherwise, its assigned to cluster 2. Hence, we can view this vector C as a cluster assignment function. To visualize the current cluster assignment, you can do the following: plot (x1, x2, col = C, pch- 19) 0.0 0.5 1.0 1.5 2.0 We know that in each iteration of the k-means algorithm, we first fix the cluster assignment function and update the cluster means mk, for k = 1,K. then, fix the cluster means and update the cluster assignment function a. [2 points] Do this iteration once, and output the new cluster assignment function (both the value of the vector C and plot it) and the cluster means for both clusters b. [2 points] Write the above two steps into a single function. Repeatedly call this function to update C and the cluster means. When they do not change anymore, stop the algorithm. You should not have an c.[2 points] Based on your final result, calculate and report the within-cluster distance of the k-mearn d. [2 points] Randomly generate another set of initial values for C and repeat the above steps. Observe if e. [2 points] Apply any clustering algorithm discussed in the lecture other than k-means on the same data excessively long output for this part. Only output the final result. algorithm, which is also the objective function used for k-means. the two runs lead to the same clustering result. Comment on your findings. set. Compare the result by using this algorithm with what you got by using k-means Question 1 (understand k-means) k-means is a relatively simple algorithm that can write by ourselves. For this question, you are not allowed to use any existing functions that perform k-means directly. Let's first generate some data. You should copy this exact code to generate the same dataset and the initial cluster assignment labels set.seed (2) n - 10 # first coordinate (uam able) of each observation x1 - rnorm(n) # second coordinate (variable) of each obser ation x2 = rnorm (n) # we also generate an initial value of the cluster assignments C sample (1:2, n, replace TRUE) ## [1] 2 1 1 1 2 2 2 1 2 2 The above code means that we consider just two clusters, and if C[1] = 1, we are currently assigning observation i to cluster 1, otherwise, its assigned to cluster 2. Hence, we can view this vector C as a cluster assignment function. To visualize the current cluster assignment, you can do the following: plot (x1, x2, col = C, pch- 19) 0.0 0.5 1.0 1.5 2.0 We know that in each iteration of the k-means algorithm, we first fix the cluster assignment function and update the cluster means mk, for k = 1,K. then, fix the cluster means and update the cluster assignment function a. [2 points] Do this iteration once, and output the new cluster assignment function (both the value of the vector C and plot it) and the cluster means for both clusters b. [2 points] Write the above two steps into a single function. Repeatedly call this function to update C and the cluster means. When they do not change anymore, stop the algorithm. You should not have an c.[2 points] Based on your final result, calculate and report the within-cluster distance of the k-mearn d. [2 points] Randomly generate another set of initial values for C and repeat the above steps. Observe if e. [2 points] Apply any clustering algorithm discussed in the lecture other than k-means on the same data excessively long output for this part. Only output the final result. algorithm, which is also the objective function used for k-means. the two runs lead to the same clustering result. Comment on your findings. set. Compare the result by using this algorithm with what you got by using k-means

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Please answer using R-code. Also note that for this question the algorithm is calculated manually, not using k-means directly Question 1 (understand k-means) k-means is a relatively simple algorithm...

python Introduction In this project, your Pac-Man agent will find paths through his maze world, to reach a particular location and (optionally) to collect food efficiently. You will build general...

Computer science help please help fill ###Your Code Here Thanks!!! QUESTION 1 def compute_arithmetic(e): """Computes the value of an arithmetic expression e.""" ### YOUR CODE HERE ## Simple tests f...

Introduction In this project, your Pac-Man agent will find paths through his maze world, to reach a particular location and (optionally) to collect food efficiently. You will build general search...

Introduction Note : Do not be intimidated by this problem! It's actually easier than it looks. We will scaffold this problem, guiding you through the creation of helper functions before you implement...

Note : Do not be intimidated by this problem! It's actually easier than it looks. We will scaffold this problem, guiding you through the creation of helper functions before you implement the actual...

MS&E 352 Decision Analysis II 03/12/2009 ____________________________________________________________________________________________ MS&E 352 Final Examination (Winter 2009) Distributed: Thursday,...

Can you please provide the answer for the question 1 MIDTERM EXAM: ADMS 4551, Winter 2010, All Sections, February 21, 2010 Student Number: ________________________ Name:...

The monthly incomes from a random sample of faculty at a university are shown below. Monthly Income ($1000s) 3.0 4.0 6.0 3.0 5.0 5.0 6.0 8.0 Using a T-Test, Compute a 90% confidence interval for the...

1. Owners of small businesses located nearby. 2. Town residents, and residents of nearby towns. Walmart is one of the largest corporations in the world, and it has obviously enjoyed tremendous...

The future value of $1,500 per year for 10 years at 8% \f

Which of the following are problems with identifying users of ABC? Multiple select question. ABC means different things to different organizations. Organizations will announce the discontinuance of...

Control (labelled behaviour in previous work): learners prefer to rely on their own initiative, or on the direction of an instructor. This has implications for the level of control required or...

Cognitive: learners are more subjective in the way they make decisions and solve problems based on personal judgements, or base their decision making more on logic and scientific approaches. This...

Receptivity: learners are predominantly receptive to practical stimuli or theoretical stimuli for learning depending on their cultural backgrounds and their experiences in national educational...