Question: Machine Learning ( Regression Models ) Instructions: Write step - by - step Python code to perform all data preparation you may consider relevant for

Machine Learning (Regression Models)
Instructions: Write step-by-step Python code to perform all data preparation you may consider relevant for the given dataset; you must perform:
- An overview of the dataset.
-1 method for data inputting and/or removal of missing values
- Data transformation: 1 method for numeric variables and/or 1 method for categorical variables.
- Exploratory data analysis: In your report you should describe your more interesting findings.
* For the univariate analysis include only 3 variables.
* For bivariate or multivariate analysis include only 3 plots
* Explain the rationale for each of your selections.
-Preliminary conclusions and insights derived from your EDA.
-Refine your ML model using R2 as evaluation metric; you mst deliver:
-The JPYNB script containing ONLY ONE model (with the best result)
Notes:
>sample_test.csv can only be used for data preprocessing and testing purposes (NOT training).
>NO tune parameter optimization will be allowed during
>The training process will be allowed using the AssignmentI_data.csv file and with the same train/test split proportion employed during your research.
Note 1: The sample_test.csv file is a small sample of the AssignmentI_data.csv file. Do not draw any conclusions regarding the performance of your model using this dataset.
Note 2: The independent set to be used for the contest consists of 1163 rows.
Objective:
To predict the next-day temperature according to the given conditi
ons, using a machine learning model (regression problem). Note: It is NOT a time-series problem.
Dataset Description:
The dataset is composed of several next-day forecast variables, maximum and minimum temperatures of present-day, and geographic auxiliary variables collected for a period of 5 years by the Korean Meteorological Service over Seoul, South Korea. The output variable is the next-day average temperature (NextDayAvTemp).
Datafiles description:
*** "AssignmentI_data": csv file /6588 rows (headers included)
*** "sample_test": csv file /116 rows (headers included)
Dataset Variables:
1. Station: Used weather station number.
2. Present_Tmax : Maximum air temperature between 0 ansample_testod 21 h on the present day (\deg C).
3. Present_Tmin: Minimum air temperature between 0 and 21 h on the present day (\deg C).
4. NextDayPred_RHmin: Forecast of next-day minimum relative humidity (%).5.NextDayPred _RHmax: Forecast of next-day maximum relative humidity (%)
6. NextDayPred _Tmax_lapse: Forecast of next-day maximum air temperature applied lapse rate (\deg C)
7. NextDayPred _Tmin_lapse: Forecast of next-day minimum air temperature applied lapse rate (\deg C)
8.NextDayPred _WS: Forecast of next-day average wind speed (m/s)
9. NextDayPred _LH: Forecast of next-day average latent heat flux (W/m2)
10. NextDayPred _CC1: Forecast of next-day 1st 6-hour split average cloud cover (0-5 h)(%).
11. NextDayPred _CC2: Forecast of next-day 2nd 6-hour split average cloud cover (6-11 h)(%).
12. NextDayPred _CC3: Forecast of next-day 3rd 6-hour split average cloud cover (12-17 h)(%).
13. NextDayPred _CC4: Forecast of next-day 4th 6-hour split average cloud cover (18-23 h)(%).
14. NextDayPred _PPT1: Forecast of next-day 1st 6-hour split average precipitation (0-5 h)(%).
15. NextDayPred _PPT2: Forecast of next-day 2nd 6-hour split average precipitation (6-11 h)(%).
16. NextDayPred _PPT3: Forecast of next-day 3rd 6-hour split average precipitation (12-17 h)(%).
17. NextDayPred _PPT4: Forecast of next-day 4th 6-hour split average precipitation (18-23 h)(%).
18. Lat: Latitude (\deg ).
19. Lon: Longitude (\deg ).
20. DEM: Elevation (m).
21. Slope: Slope (\deg ).
22. Solar radiation: Daily incoming solar radiation (wh/m2).
23. NextDayAvTemp: The next-day average air temperature (\deg C).

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Accounting Questions!