Question: Hello! I have an assignment on Data mining course. Please can you write me the preprocessing code for this exercise. The file I am going
In this problem you are required to apply various clustering techniques on a given dataset Synthetic.csv, which is an artificial dataset containing 3 non-convex clusters. The dataset contains no header and each there are two attributes for each instance. a. Apply the K-means algorithm on this dataset to generate 3 clusters b. Visualize the clusters of part a. using scatter plot c. Apply DBSCAN with =0.5 and minPts=3 d. Visualize the clusters of part c. using scatter plot. e. Apply single-linkage hierarchical clustering on this dataset to generate 3 partitions f. Visualize the clusters of part e. using scatter plot g. Apply complete-linkage hierarchical clustering on this dataset to generate 3 partitions h. Visualize the clusters of part g. using scatter plot i. Apply average-linkage hierarchical clustering on this dataset to generate 3 partitions j. Visualize the clusters of part i. using scatter plot k. Briefly compare and explain the outcomes of the previous parts of this problem. Note: in all cases, during visualization dimensionality reduction is not necessary and should not be applied
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
