Two different data sets are provided (eas508_sp23_exam1-data1.csv and eas508_sp23_exam1-data2.csv)which will provide two different challenges. There is no
Question:
Two different data sets are provided ("eas508_sp23_exam1-data1.csv" and "eas508_sp23_exam1-data2.csv")which will provide two different challenges. There is no relationship between the data sets or the properties, and you should consider them as totally separate. (80 points - 40 points for each data set)
For each data set, (i) predict the missing property values (those denoted by a '?'). You need to justify your predictions. Justifications for your predictions should include comparison between different approaches and model accuracy and robustness. Also, describe the steps that you employed prior to the regression step, and why, including which features you included and why. You should provide enough information that your prediction can be reproduced - do not assume that it is obvious the steps you followed. The regression approaches are limited to linear regression, multiple linear regression, and principal component regression. Note: I have excel sheet(.csv file), I am not able to upload so attached partial screenshots