Question: The rst objective is to combine those les and stack them as three large les, one for each time period. Run basic EDA and descriptive


The rst objective is to combine those les and stack them as three large les, one for each time period. Run basic EDA and descriptive statistics on some columns and clean any obvious outliers from each time period. Make sure that no more than 1% of the data are removed from within each time period in this process. Clearly write the details of outlier detection and descriptive analysis. Ans. Given the data files from 2010-2020, we have categorized the data into three time periods, 2010-2012, 2014-2016, 2018-2020. Using data_header colname, making sure all the data les, have consistent column names, and then using rbind, mergind individaul data into time periods data
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
