Question: Problem Description In this assignment, you will apply clustering algorithms to three toy datasets. You can try whatever clustering algorithm with whatever parameter setting. In
Problem Description
In this assignment, you will apply clustering algorithms to three toy datasets. You can try whatever clustering algorithm with whatever parameter setting. In the end, you need to report the "best" clustering output with a visualized D plot.
You can use a thirdparty library's implementation of clustering algorithms.
Your solution should include at least different clustering algorithms which could be the ones not covered in lectures for comparison.
You need to pick one validation metric eg SSE to justify the quality of the clustering output.
You should document the parameter tuning steps even if it is done manually
Input Files
We provide toy datasets Dcsv Download DcsvDcsv Download DcsvDcsv Download Dcsv for this assignment, which are all D datasets in the format of
for each line. You are encouraged to have them plot out in the first place to give you some hints on what clustering algorithms would fit.
Key Outputs
Your program should at least plot the best clustering results and specifically indicate how many clusters are found for a given dataset.
Assessment
Your program should be compilable and executable. Clear documentation on any preprocessing, parameter tuning, and validation steps.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
