Question: I would like some help understanding and solving these questions with the correct lines of code by relating to this data set in PYTHON. There
I would like some help understanding and solving these questions with the correct lines of code by relating to this data set in PYTHON. There are a lot more cells, but I just included the first 20 so I could understand it and apply it to the rest of them. Any help would be much appreciated!
1. Import the dataset and read it. There are missing values in Age, Cabin, and Embarked. Calculate the proportion of missing values in each. Plot a histogram for Age and replace the missing values with the column's average.
2. Add + Calculate a new variable to the data: Fare per person
3. Encode all the categorical variables.
4. Run a Logistic regression model, kNN, and SVM with Age, Sex, Fare per person, pclass and parch . State your response variable.
5. Which model performed the best?

Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
