Question: In this project, your task is to perform feature engineering and feature scaling ( including transformations like logarithmic scaling ) on our dataset to enhance
In this project, your task is to perform feature engineering and feature scaling including transformations like logarithmic scaling on our dataset to enhance the performance of a linear regression model. You have complete freedom to decide how many features to include, which features to use, and in what formats to present them for the linear regression model.
Please ensure the following:
You are required to manually create at least five new features that are different from those used in this lecture.
If you decide to apply feature scaling eg robust scaling be careful to avoid any data leakage.
Feel free to explore polynomial features and utilize feature selection techniques.
Use fold crossvalidation with a random seed of as shown in the notebook, to run and validate a linear regression model. Report its performance across all five folds, along with the averages. The performance metrics should include Rsquared, RMSE, and MAPE.
Your grade will mainly depend on the model's performance, especially the average Rsquared value. This value will then be scaled to determine the final score for the project. Your model should at least outperform the benchmark of but ideally, it should exceed which is the highest Rsquared value in this notebook.
Since the dataset split and model are predetermined, the results and overall performance will rely entirely on the feature engineering and scaling techniques you implement. Ultimately, Ultimately, this project aims to enhance your understanding of the crucial role these tasks play in model performance.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
