What are the distinct values of 'Year' within the data? What command did you use to determine
Question:
What are the distinct values of 'Year' within the data? What command did you use to determine this?
A statement that creates a new dataframe that only has people named 'DAVID':
val davids =
A statement that calculates the total number of people named 'DAVID' by year (across all counties). This means you need to add together the different value per country in the same year:
val davidsByYear =
A statement that calculates the total number of people with the same name and gender by year:
val sameName =
A statement that determines the maximum number of times a name was used per gender, per year:
val maxNameCounts =
A statement that joins maxNameCounts with sameName. Note that you have TWO fields to join on (Year and Sex):
val sameNameWithMaxCount =
A statement that filters the results such that we have the most used names per year/sex:
A series of statements that, starting from reading in CSV file, determines the most popular name per year across both sexes. Determine this not by directly comparing counts, but by the % of people given that name for that gender.
Concepts of Database Management
ISBN: 978-1285427102
8th edition
Authors: Philip J. Pratt, Mary Z. Last