Question: ( 2 points ) One thing we can do with this dataset is measure the 'diversity of names per year. There are two measures of

(2 points) One thing we can do with this dataset is measure the 'diversity of names per year. There are two
measures of diversity we can use:
Shannon Diversity Index: The Shannon Diversity Index is given by
H'=-i=1Npiln(pi)
where N is the number of names, pi is the proportion of the ith name, and ln the natural logarithm.
The Shannon Diversity Index makes two important assumptions:
all names are sampled; and
the sample is random.
Simpson Diversity Index: The Simpson Diversity Index is given by
=i=1Npi2
where, again, N is the number of names, and pi is the proportion of the ith name.
Note that rare names would have a very low pi value, and so rare names will not greatly affect the value
of .
The Shannon Diversity Index is a measure of 'true diversity'. The larger the number, the greater the
diversity.
The Simpson Diversity Index is actually a probability. It is the probability that if you choose two
babies born at random that they have the same name.
Given this information and the information included in the data set, which diversity index do you think is more appropriate for our data and why?
 (2 points) One thing we can do with this dataset is

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!