Question: Data Preprocessing task: Please examine the data shape etc. Check for missing values and remove, if any. Drop the categorical variables and scale the data.
Data Preprocessing task:
Please examine the data shape etc. Check for missing values and remove, if any. Drop the categorical
variables and scale the data.
Clustering Task:
Apply hierarchical clustering to the data using Euclidean distance to the normalized measurements.
Compare the dendrograms from single linkage and complete linkage. Choose the better method and
make clusters. Examine cluster profiles centroids
Hint: To obtain cluster centroids for hierarchical clustering, compute the average values of each cluster
members, using groupby with the cluster centers followed by mean:
dataframe.groupbyclusterlabelmean
Question :
How many clusters would you use? What distance is used for this cutoff?
Question : The elementary public schools would like to choose a set of cereals to include in their
daily cafeterias. Every day a different cereal is offered, but all cereals should support a healthy diet.
For this goal, you are requested to find a cluster of "healthy cereals."
Make clusters on the basis of your answer to Q
Make cluster profiles
Choose the cluster that has the healthiest cereals for children in your opinion based on the
level of ingredients.
Incorporate your answers in the Jupyter notebook, using Markdown cells.
Run the Jupyter notebook from start to finish, save in HTML format and submit in
assignments space
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
