Question: The setosa data set, representing Ronald Fisher's original 50 samples from each of three species of iris ( Iris setosa , Iris virginica , and
The setosa data set, representing Ronald Fisher's original 50 samples from each of three species of iris (Iris setosa,Iris virginica, andIris versicolor), is inherently in the R software package.
Implement the k-means clustering algorithm in R, and use it on the setosa data set. Provide the percentages of each species among the data set as well as a scattergram.
Follow-Up Question:Consider a new observed plant whose specifications land on the border of the k-means classification area betweenIris setosaandIris virginica. How might we decide which species it is?
Dataset is in the excel below because it is a long data
https://drive.google.com/file/d/1iF9s7A_u1T0OSUDK4cGn2u1IJlSEnyrq/view?usp=sharing
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
