Question: Problem Description In this assignment, you will apply clustering algorithms to three toy datasets. You can try whatever clustering algorithm with whatever parameter setting. In

Problem Description
In this assignment, you will apply clustering algorithms to three toy datasets. You can try whatever clustering algorithm with whatever parameter setting. In the end, you need to report the "best" clustering output with a visualized 2D plot.
You can use a third-party library's implementation of clustering algorithms.
Your solution should include at least 3 different clustering algorithms (which could be the ones not covered in lectures) for comparison.
You need to pick one validation metric (e.g., SSE) to justify the quality of the clustering output.
You should document the parameter tuning steps (even if it is done manually).
Input Files
We provide 3 toy datasets (D01.csv, Download D01.csv,D02.csv, Download D02.csv,D03.csv Download D03.csv) for this assignment, which are all 2D datasets in the format of
for each line. You are encouraged to have them plot out in the first place to give you some hints on what clustering algorithms would fit.
Key Outputs
Your program should at least plot the best clustering results and specifically indicate how many clusters are found for a given dataset.
Assessment
Your program should be compilable and executable. Clear documentation on any preprocessing, parameter tuning, and validation steps.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!