Question: Q 1 : Homework Consolidation 2 0 0 pts Write a program to consolidate ALL your homeworks done to date ( leave out the LinkedIN
Q : Homework Consolidation pts
Write a program to consolidate ALL your homeworks done to date leave out the LinkedIN one
In essence, you should have homeworks. The user should be able to enter the homework number,
and the homework runs as it is expected to Once the homework run is complete, we should come back
to the original menu, where the user can select another homework or type to exit
Note: Do Error Handling in menu inputs, eg not allowing any input other than
Sample Menu
: BMI
: While Question String
: FindRemainder Function
: Lists with
: First Second File to Third
: Dictionary List
: Pandas Basics
: Stats & Pandas
: Data Visualization
: Address Class
Please Enter Homework Number and to exit:
: Train Dataset pts
You have to devise an Object Oriented solution for this problem, ie you MUST create a CLASS, and
solve the problem calling the class methods.
The train dataset contains data regarding a train crash. It has ~ records. However, all records are not
complete, ie some have missing data. In this data set, our outcome of interest is the survived column,
on whether an individual survived or died after the Train crash.
Clean the data and remove any unwanted records. How many records do you have now?
points
The train picked most passengers from which station
points
Do some basic data exploration eg using commands as head info describe nunique etc
Which variables will you NOT select?
points
Are there any outliers in the data? If yes, treat them.
points
Partition the data into a training set with of the observations and testing set with of the
observations using the random state of for cross validation. points
On the partitioned data, build the best KNN model. Show the accuracy numbers. Hint: What is the
best value of k How do you decide the best k
points
On the partitioned data, build the best logistic regression model. Show the accuracy numbers.
points
On the partitioned data, build the decision tree. Show the accuracy numbers. What tree depth did
you choose, ie which one is ideal and why? points
Based on the results of knearest neighbor, and logistic regression, what is the best model to classify
the data? Provide explanation to support your argument. points
Show some interesting graphs of the data, ie that can describe the original data.
points
Requirements
Make all the "best programming practice" decisions, eg how to show the output, what prompts
to display, how to ask for input etc.
You are the developerengineer it is YOUR decision YOUR job. If the program is not presented
nicely, you will lose points.
First, all that is being asked should be done. Second, the displays should be intuitive, selfexplanatory and nicely put.
Do NOT assume that the user knows ANYTHING.
Write the program such that any new person that sits in front of the terminal, can start playing
with it ie the commands, displays etc. are adequate
Submission and Demo
MULTIPLE file submissions are allowed. Anything you submit is kept.
You will submit the following:
ipynb files for all questions, clearly labeled. QHWipynb and QTrain.ipynb
You are asked to demo the project INclass or Zoom. Details in DLEmail
The Demo files ie files you submit should NOT be commented. python code for this with the detail
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
