Question: Let's build some Basic Reports on demographic information about the US residents.. Get the dataset from here (UC Irvine Machine Learning Repository): https://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.data Write Python
Let's build some Basic Reports on demographic information about the US residents.. Get the dataset from here (UC Irvine Machine Learning Repository):
https://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.data
Write Python scripts by utilizing the pandas libraries in order to complete the following tasks in a Jupyter Notebook. For publication figures and visualizing data use the Matplotlib and/or seaborn libraries:
1. How many men and women (sex feature) are represented in this dataset?
2. What is the average age (age feature) of women?
3. What is the percentage of German citizens (native-country feature)?
4. Make a population histogram (bar plot) of people's education (education feature). What are the mean and standard deviation of age for those who earn more than 50K per year (salary feature)?
5. What are the mean and standard deviation of age for those who earn less than 50K per year?
6. Is it true that people who earn more than 50K have at least high school education? (education - Bachelors, Prof-school, Assoc-acdm, Assoc-voc, Masters or Doctorate feature)
Note:-
List of all fifteen (15) features (columns): age: workclass: fnlwgt: education: education-num: marital-status: occupation: relationship: race: sex: capital-gain: capital-loss: hours-per-week: native-country The last column has the values (salary): <=50K or >50K
Details about the list of features (columns) can be found here:
https://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.names
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
