Question: Linear regression. on Diamonds Problem Description: Dataset is: https://www.kaggle.com/shivam2503/diamonds It contains unnamed:0, carat, cut, color, clarity, depth, table, price, x, y and z columns here

Linear regression. on Diamonds

Problem Description:

Dataset is: https://www.kaggle.com/shivam2503/diamonds

It contains unnamed:0, carat, cut, color, clarity, depth, table, price, x, y and z columns

here we have to find the metrics

Approach:

  1. Identify the columns with missing values (1 point). Fill the missing values with mean value for numerical attributes and mode value for categorical attributes.
  2. Visualise the dataset.
  3. Split the data into training set and testing set. Perform 10-fold cross validation.
  4. Train a Linear regression model for the dataset.
  5. Display the coefficients and form the linear regression equation.
  6. Compute the accuracy and confusion matrix.
  7. Plot the decision boundary.

Instructions

  1. Follow the instructions in each question carefully.
  2. Python code from Jupyter notebook along with output for each cell is expected.
  3. Any assignment submitted using other python IDEs are not considered for grading.
  4. Use appropriate labels for all visualizations.
  5. Upload the output.csv file along with the notebook when required.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!