Question: Linear regression. on Diamonds Problem Description: Dataset is: https://www.kaggle.com/shivam2503/diamonds It contains unnamed:0, carat, cut, color, clarity, depth, table, price, x, y and z columns here
Linear regression. on Diamonds
Problem Description:
Dataset is: https://www.kaggle.com/shivam2503/diamonds
It contains unnamed:0, carat, cut, color, clarity, depth, table, price, x, y and z columns
here we have to find the metrics
Approach:
- Identify the columns with missing values (1 point). Fill the missing values with mean value for numerical attributes and mode value for categorical attributes.
- Visualise the dataset.
- Split the data into training set and testing set. Perform 10-fold cross validation.
- Train a Linear regression model for the dataset.
- Display the coefficients and form the linear regression equation.
- Compute the accuracy and confusion matrix.
- Plot the decision boundary.
Instructions
- Follow the instructions in each question carefully.
- Python code from Jupyter notebook along with output for each cell is expected.
- Any assignment submitted using other python IDEs are not considered for grading.
- Use appropriate labels for all visualizations.
- Upload the output.csv file along with the notebook when required.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
