Question: You will be using the dataset als_data.csv to apply clustering methods for this assignment. This data gives anonymized data on ALS patients. With this data,

You will be using the dataset als_data.csv to apply clustering methods for this assignment. This data gives anonymized data on ALS patients. With this data, complete the following steps: Remove any data that is not relevant to the patient's ALS condition. Apply a standard scalar to the data. Create a plot of the cluster silhouette score versus the number of clusters in a K-means cluster. Use the plot created in (3) to choose an optimal number of clusters for K-means. Justify your choice. Fit a K-means model to the data with the optimal number of clusters chosen in part (4). Fit a PCA transformation with two features to the scaled data. Make a scatterplot the PCA transformed data coloring each point by its cluster value. Summarize your results and make a conclusion

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!