Question: You will be using the dataset als_data.csv to apply clustering methods for this assignment. This data gives anonymized data on ALS patients. With this data,
You will be using the dataset als_data.csv to apply clustering methods for this assignment. This data gives anonymized data on ALS patients. With this data, complete the following steps: Remove any data that is not relevant to the patient's ALS condition. Apply a standard scalar to the data. Create a plot of the cluster silhouette score versus the number of clusters in a K-means cluster. Use the plot created in (3) to choose an optimal number of clusters for K-means. Justify your choice. Fit a K-means model to the data with the optimal number of clusters chosen in part (4). Fit a PCA transformation with two features to the scaled data. Make a scatterplot the PCA transformed data coloring each point by its cluster value. Summarize your results and make a conclusion
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
