Question: Please follow the steps below to complete your assignment utilizing the Titanic _ dataset.csv file. Review the variable descriptors file for the dataset. Exploration and
Please follow the steps below to complete your assignment utilizing the Titanicdataset.csv file.
Review the variable descriptors file for the dataset.
Exploration and Initial Analysis:
a Load the Titanic dataset and print the first rows.
b Display the shape and descriptive statistics of the dataset.
c Check and list the data types of each column.
Data Cleaning:
Identify and handle missing values in the dataset.
a Remove rows with missing values in Age column
b Impute missing values in Embarked column with the mode.
Subset and Dimension Reduction:
a Select relevant columns from the cleaned dataset using the following column names: PassengerId, Pclass, Name, Age, Sex, Survived or column numbers: through
b Create a subset of the first rows for reviewing.
Analysis and Reporting:
a Count the unique values in the passenger class 'Pclass' column from the output from step a
b Generate value counts for other categorical variables.
c Take a random sample of the dataset and report the sample size.
Submission
Save your Jupyter Notebook file with the name TitanicDataPreparationAssignment.ipynb. and upload back onto the Blackboard Assignment
Ensure that all code cells are properly commented to explain each step.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
