Question: Problem 2 (13 marks) File: Titanic. xlsx The data set contains personal information for 891 passengers, including an indicator variable for their survival, and the
Problem 2 (13 marks) File: Titanic. xlsx
The data set contains personal information for 891 passengers, including an indicator variable for their survival, and the objective is to predict survival, or probability thereof, from the other characteristics. The survival data for all passengers is stored in the binary variable calledSurvived. The predictors includeSex(modeled with male/female dummy variables),Age(and additional dummy variables for ranges),Class(first, second, or third, modeled with dummy variables),SiblingSpouse(number of siblings and spouses accompanying the passenger, and corresponding dummy variables),ParentChild(number of parents and children accompanying the passenger, and corresponding dummy variables), andEmbarked(ports of Cherbourg, QueensTown, and Southampton, modeled by dummy variables)
y=1 if the passenger was survived and y=0 if not
a.Write the logistic regression equation relating Class and Survived.
(2 mark)
b.For the Titanic data, use SAS to compute the estimated logistic regression equation. (3 marks)
c.What is the interpretation of when (2 marks)
d.Estimate the probability of surviving the 2nd class passengers and the 3rd class passengers. (3 marks)
e.What is the estimated odds ratio? What is the interpretation? (3 marks)
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
