Question: ( 2 points ) One thing we can do with this dataset is measure the 'diversity of names per year. There are two measures of
points One thing we can do with this dataset is measure the 'diversity of names per year. There are two
measures of diversity we can use:
Shannon Diversity Index: The Shannon Diversity Index is given by
where is the number of names, is the proportion of the name, and the natural logarithm.
The Shannon Diversity Index makes two important assumptions:
all names are sampled; and
the sample is random.
Simpson Diversity Index: The Simpson Diversity Index is given by
where, again, is the number of names, and is the proportion of the name.
Note that rare names would have a very low value, and so rare names will not greatly affect the value
of
The Shannon Diversity Index is a measure of 'true diversity'. The larger the number, the greater the
diversity.
The Simpson Diversity Index is actually a probability. It is the probability that if you choose two
babies born at random that they have the same name.
Given this information and the information included in the data set, which diversity index do you think is more appropriate for our data and why?
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
