Question: Senior Project and Seminar Please answer the following in Python and do not copy and paste from previous questions. the train.csv already downloaded In this
Senior Project and Seminar
Please answer the following in Python and do not copy and paste from previous questions. the train.csv already downloaded

In this assignment, you will need to work on Microsoft Malware Prediction problem. Here is the link to download data from Kaggle. Please make sure to download train.csv file https://www.kaggle.com/c/microsoft-malware-prediction/data First, as part of preparing and understanding the data, you need to write a code to do the following: a small sample like 1000 records using the following code: pd.read_csv("train.csv", nrows =1000) 2) Examine data types of the variables 3) Shows the top 5 rows of the data frame 4) Encode string values (if any) to integers 5) Once again, examine data types of the variables 6) Produce some histograms of the variables 7) You need to provide analysis of the missing value percentage in each variable. You can use the following code: Panda_dataframe.isnullo.sum() 8) You need to show the total number of missing values in all variables using the following code: \#The sum of the missing values in each variabledataset.isnull().sum() Panda_dataframe .isnull().sum().sum() Please review the German credit code to work on the assignment
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
