Question: Question 1 [10 points] Suppose that you have been given a text file unis.csv containing lines of comma- separated data about some universities, and how
Question 1 [10 points]
Suppose that you have been given a text file unis.csv containing lines of comma- separated data about some universities, and how their graduates report on outcomes from the education. Here are the data fields (also called data attributes):
Field name Description UniName Abbreviation of the Universitys name
State
Abbreviation of the state where the University is mostly located
| Employment(2018) | Percentage of 2018 graduates in full-time employment, three months after graduation |
| Employment(2019) | Percentage of 2019 graduates in full-time employment, three months after graduation |
The first few lines look like this (note that the first line is a header, and also note that the fields do not themselves contain any commas): UniName,State,Employment(2018),Employment(2019) CQU,QLD,79.1,79.6
Curtin,WA,72.4,71.4 Deakin,VIC,72.8,73.4
Suppose that you are part of a team whose task is to analyse the data in unis.csv to calculate the following: place the values for Employment(2018) into bins, with each bin representing a range of 5 (for example, 70 to 75, 75 to 80 , etc). For each bin, find how many states contain universities whose Employment(2018) score is within that bin.
Provide well-commented Python code that will perform this calculation. You do not need to deal with misformatted files or other errors. You are allowed to use a library like Pandas, but this is not required. It is important that your comments should clearly describe the structure used for storing the data in your program (eg if you use a dictionary, you must explain what the keys and values represent; if you use Pandas, you must indicate the indices of the dataframes your code refers to, etc).
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
