The accompanying table contains a portion of data from the National Longitudinal Survey (NLS), which follows over
Question:
The accompanying table contains a portion of data from the National Longitudinal Survey (NLS), which follows over 12,000 individuals in the United States over time. Variables in this analysis include Urban (1 if lives in urban area, 0 otherwise), Siblings (number of siblings), White (1 if white, 0 otherwise), Christian (1 if Christian, 0 otherwise), FamilySize, Height, Weight (in pounds), and Income (in $).
a. Use hierarchical clustering to group the respondents. Use Gower’s coefficient for the distance between observations and Ward’s clustering method to cluster the individuals into four clusters. How many individuals are in the largest cluster?
b. Describe the characteristics of the clusters.
Step by Step Answer:
Business Analytics Communicating With Numbers
ISBN: 9781260785005
1st Edition
Authors: Sanjiv Jaggia, Alison Kelly, Kevin Lertwachara, Leida Chen