Question: Python Program (MUST USE MRJob) You will use MrJob to write a program, baby_names.py, that will rank baby names from a large dataset. The CSV

Python Program (MUST USE MRJob)

You will use MrJob to write a program, baby_names.py, that will rank baby names from a large dataset.

The CSV files are formatted as follows:

WY,F,1910,Mary,27 WY,F,1910,Margaret,22 WY,F,1910,Helen,13 WY,F,1910,Alice,10

The first column is the state, the second is the sex, the third is the year, then the name, and finally the count for that name/sex that year.

Your MrJob program will read CSV files formatted like those given, and compute the overall frequency for each name and will output a sorted (descending order of frequency) list of baby names, like so:

914 ["Adam", "M"] 909 ["Angela", "F"] 877 ["Megan", "F"] 

where the number is the number of times the name occcured, followed by the name and sex.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!